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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT: Garner, Ian 



Dalrymple, Michael A 
Prunkard, Donna E 
Foster, Donald C 



(ii) TITLE OF INVENTION: Production of Fibrinogen in Transgenic 



(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ZymoGeneti cs , Inc. 

(B) STREET: 4225 Roosevelt Way, N.E. 

(C) CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 98105 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Parker, Gary E 

(B) REGISTRATION NUMBER: 31-648 

(C) REFERENCE/DOCKET NUMBER: 93-15 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-547-8080 ext 322 

(B) TELEFAX: 206-548-2329 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Human Fibrinogen A-alpha chain 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(31..84, 1154.. 1279, 1739.. 1922, 3055.. 3200, 

3786. .5210) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GTCTAGGAGC CAGCCCCACC CTTAGAAAAG ATG TTT TCC ATG AGG ATC GTC TGC 54 

Met Phe Ser Met Arg He Val Cys 
1 5 

CTA GTT CTA AGT GTG GTG GGC ACA GCA TGG GTATGGCCCT TTTCATTTTT 104 
Leu Val Leu Ser Val Val Gly Thr Ala Trp 
10 15 

TCTTCTTGCT TTCTCTCTGG TGTTTATTCC ACAAAGAGCC TGGAGGTCAG AGTCTACCTG 164 

CTCTATGTCC TGACACACTC TTAGCTTTAT GACCCCAGGC CTGGGAGGAA ATTTCCTGGG 224 

TGGGCTTGAC ACCTCAAGAA TACAGGGTAA TATGACACCA AGAGGAAGAT CTTAGATGGA 284 

TGAGAGTGTA CAACTACAAG GGAAACTTTA GCATCTGTCA TTCAGTCTTA CCACATTTTG 344 

TTTTGTTTTG TTTTAAAAAG GGCAAGAATT ATTTGCCATC CTTGTACCTA TAAAGCCTTG 404 

GTGCATTATA ATGCTAGTTA ATGGAATAAA ACATTTTATG GTAAGATTTG TTTTCTTTAG 464 

TTATTAATTT CTTGCTACTT GTCCATAATA AGCAGAACTT TTAGTGTTAG TACAGTTTTG 524 

CTGAAAGGTT ATT6TTGTGT TTGTCAAGAC AGAAGAAAAA GCAAACGAAT TATCTTTGGA 584 



AATATCTTTG CAGTATCAGA AGAGATTAGT TAGTAAG6CA ATACGCTTTT CCGCAGTAAT 644 

GGTATTCTTT TAAATTATGA ATCCATCTCT AAAGGTTACA TAGAAACTTG AAGGAGAGAG 704 

GAACATTCAG TTAAGATAGT CTAGGTTTTT CTACTGAAGC AGCAATTACA GGAGAAAGAG 764 

CTCTACAGTA GTTTTCAACT TTCTGTCTGC AGTCATTAGT AAAAATGAAA AGGTAAAATT 824 

TAACTGATTT TATAGATTCA AATAATTTTC CTTTTAGGAT GGATTCTTTA AAACTCCTAA 884 

TATTTATCAA ATGCTTATTT AAGTGTCACA CACAGTTAAG AAATTTGTAC ACCTTGTCTC 944 

CTTTAATTCT CATAACAACT CCATAAAATG GGTCCTAGGA TTTCCATTTG AAGATAAGAA 1004 

ACCTGAAGCT TGCCGAAGCC CTGTGTCTGC TCTCCTTAAT CTCTGTGAGA GTGCCATCTC 1064 

TTCCTGGGGA CTTGTAGGCA TGCCACTGTC TCCTCTTCTG GCTAACATTG CTGTTGCTCT 1124 

CTTTTGTGTA TGTGAATGAA TCTTTAAAG ACT GCA GAT AGT GGT GAA GGT GAC 1177 

Thr Ala Asp Ser Gly Glu Gly Asp 
20 25 

TTT CTA GCT GAA GGA GGA GGC GTG CGT GGC CCA AGG GTT GTG GAA AGA 1225 
Phe Leu Ala Glu Gly Gly Gly Val Arg Gly Pro Arg Val Val Glu Arg 
30 35 40 

CAT CAA TCT GCC TGC AAA GAT TCA GAC TGG CCC TTC TGC TCT GAT GAA 1273 
His Gin Ser Ala Cys Lys Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu 
45 50 55 

GAC TGG GTAAGCAGTC AGCGGGGGAA GCAGGAGATT CCTTCCCTCT GATGCTAGAG 1329 
Asp Trp 
60 

GGGCTCACAG GCTGACCTGA TTGGTCCCAG AAACTTTTTT AAATAGAAAA TAATTGAATA 1389 

GTTACCTACA TAGCAAATAA AGAAAAGGAA CCTACTCCCA AGAGCACTGT TTATTTACCT 1449 

CCCCAACTCT GGATCATTAG TGGGTGAACA GACAGGATTT CAGTTGCATG CTCAGGCAAA 1509 

ACCAGGCTCC TGAGTATTGT GGCCTCAATT TCCTGGCACC TATTTATGGC TAAGTGGACC 1569 

CTCATTCCAG AGTTTCTCTG CGACCTCTAA CTAGTCCTCT TACCTACTTT TAAGCCAACT 1629 



TATCTGGAAG AGAAAGGGTA GGAAGAAATG GGGGCTGCAT GGAAACATGC AAAATTATTC 1689 

TGAATCTGAG AGATAGATCC TTACTGTAAT TTTCTCCCTT CACTTTCAG AAC TAC 1744 

Asn Tyr 

AAA TGC CCT TCT GGC TGC AGG ATG AAA GGG TTG ATT GAT GAA GTC AAT 1792 
Lys Cys Pro Ser Gly Cys Arg Met Lys Gly Leu He Asp Glu Val Asn 
65 70 75 

CAA GAT TTT ACA AAC AGA ATA AAT AAG CTC AAA AAT TCA CTA TTT GAA 1840 
Gin Asp Phe Thr Asn Arg lie Asn Lys Leu Lys Asn Ser Leu Phe Glu 
80 85 90 

TAT CAG AAG AAC AAT AAG GAT TCT CAT TCG TTG ACC ACT AAT ATA ATG 1888 
Tyr Gin Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn He Met 
95 100 105 110 

GAA ATT TTG AGA GGC GAT TTT TCC TCA GCC AAT A GTAAGTATTA 1932 
Glu He Leu Arg Gly Asp Phe Ser Ser Ala Asn 
115 120 

CATATTTACT TCTTTGACTT TATAACAGAA ACAACAAAAA TCCTAAATAA ATATGATATC 1992 

CGCTTATATC TATGACAATT TCATCCCAAA GTACTTAGTG TAGAAACACA TACCTTCATA 2052 

ATATCCCTGA AAATTTTAAG AGGGAGCTTT TGTTTTCGTT ATTTTTTCAA AGTAAAAGAT 2112 

GTTAACTGAG ATTGTTTAAG GTCACAAAAT AAGTCAGAAT TTTGGATTAA AACAAGAATT 2172 

TAAATGTGTT CTTTTCAACA GTATATACTG AAAGTAGGAT GGGTCAGACT CTTTGAGTTG 2232 

ATATTTTTGT TTCTGCTTTG TAAAGGTGAA AACTGAGAGG TCAAGGAACT TGTTCAAAGA 2292 

CACAGAGCTG GGAATTCAAC TCCCAGACTC CACTGAGCTG ATTAGGTAGA TTTTTAAATT 2352 

TAAAATATAG GGTCAAGCTA CGTCATTCTC ACAGTCTACT CATTAGGGTT AGGAAACATT 2412 

GCATTCACTC TGGGCATGGA CAGCGAGTCT AGGGAGTCCT CAGTTTCTCA AGTTTTGCTT 2472 

TGCCTTTTTA CACCTTCACA AACACTTGAC ATTTAAAATC AGTGATGCCA ACACTAGCTG 2532 

GCAAGTGAGT GATCCTGTTG ACCCAAAACA GCTTAGGAAC CATTTCAAAT CTATAGAGTT 2592 

AAAAAGAAAA GCTCATCAGT AAGAAAATCC AATATGTTCA AGTCCCTTGA TTAAGGATGT 2652 



TATAAAATAA TTGAAATGCA ATCAAACCAA CTATTTTAAC TCCAAATTAC ACCTTTAAAA 2712 

TTCCAAAGAA AGTTCTTCTT CTATATTTCT TTGGGATTAC TAATTGCTAT TAGGACATCT 2772 

TAACTGGCAT TCATGGAAGG CTGCAGGGCA TAACATTATC CAAAAGTCAA ATGCCCCATA 2832 

GGTTTTGAAC TCACAGATTA AACTGTAACC AAAATAAAAT TAGGCATATT TACAAGCTAG 2892 

TTTCTTTCTT TCTTTTTTCT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT 2952 

CTTTCTTTCT TTCTCCTTCC TTCCTTTCTT CCTTTCTTTT TTGCTGGCAA TTACAGACAA 3012 

ATCACTCAGC AGCTACTTCA ATAACCATAT TTTCGATTTC AG AC CGT GAT AAT 3055 

Asn Arg Asp Asn 
125 

ACC TAG AAC CGA GTG TCA GAG GAT CTG AGA AGC AGA ATT GAA GTC CTG 3113 
Thr Tyr Asn Arg Val Ser Glu Asp Leu Arg Ser Arg lie Glu Val Leu 
130 135 140 

AAG CGC AAA GTC ATA GAA AAA GTA CAG CAT ATC CAG CTT CTG CAG AAA 3161 
Lys Arg Lys Val He Glu Lys Val Gin His He Gin Leu Leu Gin Lys 
145 150 155 

AAT GTT AGA GCT CAG TTG GTT GAT ATG AAA CGA CTG GAG GTAAGTATGT 3210 
Asn Val Arg Ala Gin Leu Val Asp Met Lys Arg Leu Glu 
160 165 170 

GGCTGTGGTC CCGAGTGTCC TTGTTTTTGA GTAGAGGGAA AAGGAAGGCG ATAGTTATGC 3270 

ACTGAGTGTC TACTATATGC AGAGAAAAGT GTTATATCCA TCATCTACCT AAAAGTAGGT 3330 

ATTATTTTCC TCACTCCACA GTTGAAGAAA AAAAAATTCA GAGATATTAA GTAAATTTTC 3390 

CAACGTACAT AGATAGTAAT TCAAAGCAAT GTTCAGTCCC TGTCTATTCC AAGCCATTAC 3450 

ATCACCACAC CTCTGAGCCC TCAGCCTGAG TTCACCAAGG ATCATTTAAT TAGCGTTTCC 3510 

TTTGAGAGGG AATAGCACCT TACTCTTGAT CCATTCTGAG GCTAAGATGA ATTAAACAGC 3570 

ATCCATTGCT TATCCTG6CT AGCCCTGCAA TACCCAACAT CTCTTCCACT GAGGGTGCTC 3630 

GATAGGCAGA AAACAGAGAA TATTAAGTGG TAGGTCTCCG AGTCAAAAAA AATGAAACCA 3690 



GTTTCCAGAA GGAAAATTAA CTACCAGGAA CTCAATAGAC GTAGTTTATG TATTTGTATC 3750 

TACATTTTCT CTTTATTTTT CTCCCCTCTC TCTAG GTG GAC ATT GAT ATT AAG 3803 

Val Asp He Asp He Lys 
175 

ATC CGA TCT TGT CGA GGG TCA TGC AGT AGG GCT TTA GCT CGT GAA GTA 3851 

lie Arg Ser Cys Arg Gly Ser Cys Ser Arg Ala Leu Ala Arg Glu Val 
180 185 190 

GAT CTG AAG GAC TAT GAA GAT CAG CAG AAG CAA CTT GAA CAG GTC ATT 3899 

Asp Leu Lys Asp Tyr Glu Asp Gin Gin Lys Gin Leu Glu Gin Val He 
195 200 205 

GCC AAA GAC TTA CTT CCC TCT AGA GAT AGG CAA CAC TTA CCA CTG ATA 3947 

Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg Gin His Leu Pro Leu He 

210 215 220 

AAA ATG AAA CCA GTT CCA GAC TTG GTT CCC GGA AAT TTT AAG A6C CAG 3995 

Lys Met Lys Pro Val Pro Asp Leu Val Pro Gly Asn Phe Lys Ser Gin 
225 230 235 240 

CTT CAG AAG GTA CCC CCA GAG TGG AAG GCA TTA ACA GAC ATG CCG CAG 4043 

Leu Gin Lys Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro Gin 
245 250 255 

ATG AGA ATG GAG TTA GAG AGA CCT GGT GGA AAT GAG ATT ACT CGA GGA 4091 

Met Arg Met Glu Leu Glu Arg Pro Gly Gly Asn Glu He Thr Arg Gly 
250 255 270 

GGC TCC ACC TCT TAT GGA ACC GGA TCA GAG ACG GAA AGC CCC AGG AAC 4139 

Gly Ser Thr Ser Tyr Gly Thr Gly Ser Glu Thr Glu Ser Pro Arg Asn 
275 280 285 

CCT AGC AGT GCT GGA AGC TGG AAC TCT GGG AGC TCT GGA CCT GGA AGT 4187 

Pro Ser Ser Ala Gly Ser Trp Asn Ser Gly Ser Ser Gly Pro Gly Ser 

290 295 300 

ACT GGA AAC CGA AAC CCT GGG AGC TCT GGG ACT GGA GGG ACT GCA ACC 4235 

Thr Gly Asn Arg Asn Pro Gly Ser Ser Gly Thr Gly Gly Thr Ala Thr 
305 310 315 320 

TGG AAA CCT GGG AGC TCT GGA CCT GGA AGT GCT GGA AGC TGG AAC TCT 4283 

Trp Lys Pro Gly Ser Ser Gly Pro Gly Ser Ala Gly Ser Trp Asn Ser 
325 330 335 



GGG AGC TCT GGA ACT GGA AGT ACT GGA AAC CAA AAC CCT GGG AGC CCT 4331 

Gly Ser Ser Gly Thr Gly Ser Thr Gly Asn Gin Asn Pro Gly Ser Pro 
340 345 350 

AGA CCT GGT AGT AC C GGA ACC TGG AAT CCT GGC AGC TCT GAA CGC GGA 4379 

Arg Pro Gly Ser Thr Gly Thr Trp Asn Pro Gly Ser Ser Glu Arg Gly 
355 360 365 

AGT GCT GGG CAC TGG ACC TCT GAG AGC TCT GTA TCT GGT AGT ACT GGA 4427 

Ser Ala Gly His Trp Thr Ser Glu Ser Ser Val Ser Gly Ser Thr Gly 

370 375 380 

CAA TGG CAC TCT GAA TCT GGA AGT TTT AGG CCA GAT AGC CCA GGC TCT 4475 

Gin Trp His Ser Glu Ser Gly Ser Phe Arg Pro Asp Ser Pro Gly Ser 
385 390 395 400 

GGG AAC GCG AGG CCT AAC AAC CCA GAC TGG GGC ACA TTT GAA GAG GTG 4523 

Gly Asn Ala Arg Pro Asn Asn Pro Asp Trp Gly Thr Phe Glu Glu Val 
405 410 415 

TCA GGA AAT GTA AGT CCA GGG ACA AGG AGA GAG TAC CAC ACA GAA AAA 4571 

Ser Gly Asn Val Ser Pro Gly Thr Arg Arg Glu Tyr His Thr Glu Lys 
420 425 430 

CTG GTC ACT TCT AAA GGA GAT AAA GAG CTC AGG ACT GGT AAA GAG AAG 4619 

Leu Val Thr Ser Lys Gly Asp Lys Glu Leu Arg Thr Gly Lys Glu Lys 
435 440 445 

GTC ACC TCT GGT AGC ACA ACC ACC ACG CGT CGT TCA TGC TCT AAA ACC 4667 

Val Thr Ser Gly Ser Thr Thr Thr Thr Arg Arg Ser Cys Ser Lys Thr 

450 455 460 

GTT ACT AAG ACT GTT ATT GGT CCT GAT GGT CAC AAA GAA GTT ACC AAA 4715 

Val Thr Lys Thr Val He Gly Pro Asp Gly His Lys Glu Val Thr Lys 
465 470 475 480 

GAA GTG GTG ACC TCC GAA GAT GGT TCT GAC TGT CCC GAG GCA ATG GAT 4763 

Glu Val Val Thr Ser Glu Asp Gly Ser Asp Cys Pro Glu Ala Met Asp 
485 490 495 

TTA GGC ACA TTG TCT GGC ATA GGT ACT CTG GAT GGG TTC CGC CAT AGG 4811 

Leu Gly Thr Leu Ser Gly He Gly Thr Leu Asp Gly Phe Arg His Arg 
500 505 510 



CAC CCT GAT GAA GCT GCC TTC TTC GAC ACT GCC TCA ACT GGA AAA ACA 4859 
His Pro Asp Glu Ala Ala Phe Phe Asp Thr Ala Ser Thr Gly Lys Thr 
515 520 525 

TTC CCA GGT TTC TTC TCA CCT ATG TTA GGA GAG TTT GTC AGT GAG ACT 4907 
Phe Pro Gly Phe Phe Ser Pro Met Leu Gly Glu Phe Val Ser Glu Thr 
530 535 540 

GAG TCT AGG GGC TCA GAA TCT GGC ATC TTC ACA AAT ACA AAG GAA TCC 4955 
Glu Ser Arg Gly Ser Glu Ser Gly He Phe Thr Asn Thr Lys Glu Ser 
545 550 555 560 

AGT TCT CAT CAC CCT GGG ATA GCT GAA TTC CCT TCC CGT GGT AAA TCT 5003 
Ser Ser His His Pro Gly He Ala Glu Phe Pro Ser Arg Gly Lys Ser 
565 570 575 

TCA AGT TAC AGC AAA CAA TTT ACT AGT AGC ACG AGT TAC AAC AGA GGA 5051 
Ser Ser Tyr Ser Lys Gin Phe Thr Ser Ser Thr Ser Tyr Asn Arg Gly 
580 585 590 

GAC TCC ACA TTT GAA AGC AAG AGC TAT AAA ATG GCA GAT GAG GCC GGA 5099 
Asp Ser Thr Phe Glu Ser Lys Ser Tyr Lys Met Ala Asp Glu Ala Gly 
595 600 605 

AGT GAA GCC GAT CAT GAA GGA ACA CAT AGC ACC AAG AGA GGC CAT GCT 5147 
Ser Glu Ala Asp His Glu Gly Thr His Ser Thr Lys Arg Gly His Ala 
610 615 620 

AAA TCT CGC CCT GTC AGA GGT ATC CAC ACT TCT CCT TTG GGG AAG CCT 5195 
Lys Ser Arg Pro Val Arg Gly He His Thr Ser Pro Leu Gly Lys Pro 
625 630 635 640 

TCC CTG TCC CCC TAGACTAAGT TAAATATTTC TGCACAGTGT TCCCATGGCC 5247 
Ser Leu Ser Pro 

645 

CCTTGCATTT CCTTCTTAAC TCTCTGTTAC ACGTCATTGA AACTACACTT TTTTGGTCTG 5307 

TTTTTGTGCT AGACTGTAAG TTCCTTGGGG GCAGGGCCTT TGTCTGTCTC ATCTCTGTAT 5367 

TCCCAAATGC CTAACAGTAC AGAGCCATGA CTCAATAAAT ACATGTTAAA TGGATGAATG 5427 

AATTCCTCTG AAACTCTATT TGAGCTTATT TAGTCAAATT CTTTCACTAT TCAAAGTGTG 5487 

TGCTATTAGA ATTGTCACCC AACTGATTAA TCACATTTTT AGTATGTGTC TCAGTTGACA 5547 



TTTAGGTCAG GCTAAATACA AGTTGTGTTA GTATTAAGTG AGCTTAGCTA CCTGTACTGG 5507 

TTACTT6CTA TTAGTTTGTG CAAGTAAAAT TCCAAATACA TTTGAGGAAA ATCCCCTTTG 5667 

CAATTTGTAG GTATAAATAA CCGCTTATTT GCATAAGTTC TATCCCACTG TAAGTGCATC 5727 

CTTTCCCTAT GGAGGGAAGG AAAGGAGGAA GAAAGAAAGG AAGGGAAAGA AACAGTATTT 5787 

GCCTTATTTA ATCTGAGCCG TGCCTATCTT TGTAAAGTTA AATGAGAATA ACTTCTTCCA 5847 

ACCAGCTTAA TTTTTTTTTT AGACTGTGAT GATGTCCTCC AAACACATCC TTCAGGTACC 5907 

CAAAGTGGCA TTTTCAATAT CAAGCTATCC GGATCC 5943 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Phe Ser Met Arg He Val Cys Leu Val Leu Ser Val Val Gly Thr 
15 10 15 

Ala Trp Thr Ala Asp Ser Gly Glu Gly Asp Phe Leu Ala Glu Gly Gly 

20 . . 25 30 

Gly Val Arg Gly Pro Arg Val Val Glu Arg His Gin Ser Ala Cys Lys 
35 40 45 

Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu Asp Trp Asn Tyr Lys Cys 
50 55 60 

Pro Ser Gly Cys Arg Met Lys Gly Leu He Asp Glu Val Asn Gin Asp 
65 70 75 80 

Phe Thr Asn Arg He Asn Lys Leu Lys Asn Ser Leu Phe Glu Tyr Gin 
85 90 95 
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Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn lie Met Glu He 
100 105 110 

Leu Arg Gly Asp Phe Ser Ser Ala Asn Asn Arg Asp Asn Thr Tyr Asn 
115 120 125 

Arg Val Ser Glu Asp Leu Arg Ser Arg lie Glu Val Leu Lys Arg Lys 
130 135 140 

Val He Glu Lys Val Gin His He Gin Leu Leu Gin Lys Asn Val Arg 
145 150 155 160 

Ala Gin Leu Val Asp Met Lys Arg Leu Glu Val Asp He Asp He Lys 
165 170 175 

He Arg Ser Cys Arg Gly Ser Cys Ser Arg Ala Leu Ala Arg Glu Val 
180 185 190 

Asp Leu Lys Asp Tyr Glu Asp Gin Gin Lys Gin Leu G1u Gin Val He 
195 200 205 

Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg Gin His Leu Pro Leu He 
210 215 220 

Lys Met Lys Pro Val Pro Asp Leu Val Pro Gly Asn Phe Lys Ser Gin 
225 230 235 240 

Leu Gin Lys Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro Gin 
245 250 255 

Met Arg Met Glu Leu Glu Arg Pro Gly Gly Asn Glu He Thr Arg Gly 
260 265 270 

Gly Ser Thr Ser Tyr Gly Thr Gly Ser Glu Thr Glu Ser Pro Arg Asn 
275 280 285 

Pro Ser Ser Ala Gly Ser Trp Asn Ser Gly Ser Ser Gly Pro Gly Ser 
290 295 300 



Thr Gly Asn Arg Asn Pro Gly Ser Ser Gly Thr Gly Gly Thr Ala Thr 
305 310 315 320 



Trp Lys Pro Gly Ser Ser Gly Pro Gly Ser Ala Gly Ser Trp Asn Ser 
325 330 335 



• 
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Gly Ser Ser Gly Thr Gly Ser Thr Gly Asn Gin Asn Pro Gly Ser Pro 
340 345 350 

Arg Pro Gly Ser Thr Gly Thr Trp Asn Pro Gly Ser Ser Glu Arg Gly 
355 360 365 

Ser Ala Gly His Trp Thr Ser Glu Ser Ser Val Ser Gly Ser Thr Gly 
370 375 380 

Gin Trp His Ser Glu Ser Gly Ser Phe Arg Pro Asp Ser Pro Gly Ser 
385 390 395 400 

Gly Asn Ala Arg Pro Asn Asn Pro Asp Trp Gly Thr Phe Glu Glu Val 
405 410 415 

Ser Gly Asn Val Ser Pro Gly Thr Arg Arg Glu Tyr His Thr Glu Lys 
420 425 430 

Leu Val Thr Ser Lys Gly Asp Lys Glu Leu Arg Thr Gly Lys Glu Lys 
435 440 445 

Val Thr Ser Gly Ser Thr Thr Thr Thr Arg Arg Ser Cys Ser Lys Thr 
450 455 460 

Val Thr Lys Thr Val He Gly Pro Asp Gly His Lys Glu Val Thr Lys 
465 470 475 480 

Glu Val Val Thr Ser Glu Asp Gly Ser Asp Cys Pro Glu Ala Met Asp 
485 490 495 

Leu Gly Thr Leu Ser Gly He Gly Thr Leu Asp Gly Phe Arg His Arg 
500 505 510 

His Pro Asp Glu Ala Ala Phe Phe Asp Thr Ala Ser Thr Gly Lys Thr 
515 520 525 

Phe Pro Gly Phe Phe Ser Pro Met Leu Gly Glu Phe Val Ser Glu Thr 
530 535 540 

Glu Ser Arg Gly Ser Glu Ser Gly He Phe Thr Asn Thr Lys Glu Ser 
545 550 555 560 

Ser Ser His His Pro Gly He Ala Glu Phe Pro Ser Arg Gly Lys Ser 



565 



570 



575 



Ser Ser Tyr Ser Lys 
580 

Asp Ser Thr Phe Glu 
595 

Ser Glu Ala Asp His 
610 

Lys Ser Arg Pro Val 
625 



Gin Phe Thr Ser Ser Thr 
585 

Ser Lys Ser Tyr Lys Met 
600 

Glu Gly Thr His Ser Thr 
615 

Arg Gly lie His Thr Ser 
630 635 



Ser Tyr Asn Arg Gly 
590 

Ala Asp Glu Ala Gly 
605 

Lys Arg Gly His Ala 
620 

Pro Leu Gly Lys Pro 
640 



Ser Leu Ser Pro 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8878 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: human fibrinogen B-beta chain 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 1..469 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 470.. 583 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 584.. 3257 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3258. .3449 
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(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 3450.. 3938 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3939. .4122 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 4123.. 5042 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 5043.. 5270 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 5271.. 5830 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 5831.. 5944 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 5945.. 6632 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 6633.. 6758 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 6759.. 6966 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 6967.. 7252 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 7253.. 7870 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 7871.. 8102 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 8103.. 8537 

(ix) FEATURE: 

(A) NAME/KEY: niisc_RNA 

(B) LOCATION: 8538.. 8878 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: joi n{470 . . 583, 3258.. 3449, 3939.. 4122, 
5043. .5270, 

5831. .5944, 6633. .6758, 6967. .7252, 7871. .8102) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GAATTCATGC CCCTTTTGAA ATAGACTTAT GTCATTGTCA GAAAACATAA GCATTTATGG 60 

TATATCATTA ATGAGTCACG ATTTTAGTGG TTGCCTTGTG AGTAGGTCAA ATTTACTAAG 120 

CTTAGATTTG TTTTCTCACA TATTCTTTCG GAGCTTGTGT AGTTTCCACA TTAATTTACC 180 

AGAAACAAGA TACACACTCT CTTTGAGGAG TGCCCTAACT TCCCATCATT TTGTCCAATT 240 

AAATGAATTG AAGAAATTTA ATGTTTCTAA ACTAGACCAA CAAAGAATAA TAGTTGTATG 300 

ACAAGTAAAT AAGCTTTGCT GGGAAGATGT TGCTTAAATG ATAAAATGGT TCAGCCAACA 360 

AGTGAACCAA AAATTAAATA TTAACTAAGG AAAGGTAACC ATTTCTGAAG TCATTCCTAG 420 

CAGAGGACTC AGATATATAT AGGATTGAAG ATCTCTCAGT TAAGTCTAC ATG AAA 475 

Met Lys 
1 

AGG ATG GTT TCT TGG AGC TTC CAC AAA CTT AAA ACC ATG AAA CAT CTA 523 
Arg Met Val Ser Trp Ser Phe His Lys Leu Lys Thr Met Lys His Leu 
5 10 15 

TTA TTG CTA CTA TTG TGT GTT TTT CTA GTT AAG TCC CAA GGT GTC AAC 571 
Leu Leu Leu Leu Leu Cys Val Phe Leu Val Lys Ser Gin Gly Val Asn 



20 25 30 

GAC AAT GAG GAG GTGAATTTTT TAAAGCATTA TTATATTATT AGTAGTATTA 623 
Asp Asn G1u G1u 
35 

TTAATATAAG ATGTAACATA ATCATATTAT GTGCTTATTT TAATGAAATT AGCATTGCTT 683 

ATAGTTATGA AATGGAATTG TTAACCTCTG ACTTATTGTA TTTAAAGAAT GTTTCATAGT 743 

ATTTCTTATA TAAAAACAAA GTAATTTCTT GTTTTCTAGT TTATCACCTT TGTTTTCTTA 803 

AGATGAGGAT GGCTTAGCTA ATGTAAGATG TGTTTTTCTC ACTTGCTATT CTGAGTACTG 853 

TGATTTTCAT TTACTTCTAG CAATACAGGA TTACAATTAA GAGGACAAGA TCTGAAAATC 923 

TCACAAACTA TAAAATAATA AAAGAGCAGA ATTTTAAGAT AAAAGAAACT GGTGGTAGGT 983 

AGATTGTTCT TTGGTGAAGG AAGGTAATAT ATATTGTTAC TGAGATTACT ATTTATAAAA 1043 

ATTATAACTA AGCCTAAAAG CAAAATACAT CAAGTGTAAT GATAGAAAAT GAAATATTGC 1103 

TTTTTTCAGA TGAAAAGTTC AAATTAGAGT TAGTGTGTAT TGTTATTATT AATAGTTATG 1163 

AAACACGGTT CAGTCTAATT TATTTATTTG TAGAACAGTT TGTCCTCAAC TATTATTTTT 1223 

GCTGACTTAT TGCTGTTAAT TTGCAGTTAC TAAAAATACA GAAATGCATT TAGGACAATG 1283 

GATATTTAAG AAATTTAAAT TTTATCATCA AACGTATCAT GGCCAAATTT CTTACATATA 1343 

GCATAGTATC ATTAAACTAG AAATAAGAAT ACACAATAAT ATTTAAATGA AGTGATTCAT 1403 

TTCGGATCAT TATTGAGTTT CAAGGGAACT TGAGTGTTGT ACTTATCAGA CTCTACATGT 1463 

AAGAACATAT AGTTAATCTG GTTGTGTGTG TAAAAACATA TGGTTAATCT GGTTAAGTCT 1523 

GGTTAATCAT ATTAGGTAAG AAAAATGTAA AGAATGTGTA AGACGAAATT TTTGTAAAGT 1583 

ACTCTGCAAA GCACTTTCAC ATTTCTGCTT ATCAACTAAA CCTCACAGAG ATAGTTTAAT 1643 

AGTTTAGGCT TTAAAATGGA TTTTGATTAT TCAACAAGTG GCCTTCATAA TTTCTTTAAG 1703 

TGTTTTTCTT TAAGTATATA CTTTCTTTAA ATATTTTTTA AAATTTCCTT TTCTCTAGTA 1753 

AAGCCAGACC ATCCATGCTA CCTCTCTAGT GGCACTCTGA AATAAAAAGA AAATAGTTTT 1823 




CTCTGTTATA ATTGTATTTG TAATAAGCAG ATGAATCACA TTTCTTAAAA TTTGTTTTAG 1883 

AGAGGGTAAG CTCTGACTAG GACCATGACT TCAATGTGAA ATATGTATAT ATCCTCCGAA 1943 

TCTTTACATA TTAAGAATGT ATATAGTCAA CTGGTTAAAC AGGAAAATCT GGAACAGCCT 2003 

GGCTGGGTTT TAATCTTAGC ACCATCCTAC TAAATGTTAA ATAATATTAT AATCTAATGA 2053 

ATAAATGACA ATGCAATTCC AAATAGAGTT CATCTGATGA CTTCTAGACT CACAAAATTG 2123 

CAAGAGAGCT CAGTTGTTGC TCAGTTGTTC CAAATCATGT CGTTTGTTAA TTTGTAATTA 2183 

AGCTCCAAAG GATGTATAGC TACTGACAAA AAAAAAAATG AGAATGTAGT TAATCCAAAT 2243 

CAAAACTTTC CTATTGCAAT GCGTATTTTC TGCTTCATTA TCCTTTAATA TAATATTTTA 2303 

AGTTAGCAAG TAATTTTAAT TACAATGCAC AAGCCTTGAG AATTATTTTA AATATAAGAA 2353 

AATCATAATG TTTGATAAAG AAATCATGTA AGAAATTTCA AGATAATGGT TTAACAAATA 2423 

ATTTTGTTGA TAGAAGATAA GACTAAAAGT GAAATTCGAA GTGGAGAGGA CACTTAAACT 2483 

GTAGTACTTG TTATGTGTGA TTCCAGTAAA AATAGTAATG AGCACTTATT ATTGCCAAGT 2543 

ACTGTTCTGA GGGTACCATA TGCAATAAGT TATTTAATCC TTACAATAAT CTTGTAAG6C 2603 

AGATTCAAAC TATCATTACA CTTATTTTAC AGATGAGAAA ACTGGGGCAC AGATAAAGCA 2663 

ACTTGCCCAA GGTCTCATAG CTGTAAGTCA ACCCTACGGT CAAGACCTAC AAGTAGCCGA 2723 

GCTCGAGAGT AGATTATGAG GGTCAAAGAT TGTCTTATTA CAAATAAATT CCAAGTAGAA 2-783 

TCAACCTTTA ATAAGTCTTT AATGTCTCTT AAATATGTTT ATATAGGAGT CTAATCACCA 2843 

ATTCACAAAA ATGAAAGTAG GGAAATGATT AACAATAATC ATAGGAATCT AACAATCCAA 2903 

GTGGCTTGAG AATATTCATT CTTCTTGACA GTATAGATTC TTTACAATTT CGTAAGTTCC 2963 

AATGTATGTT TTAGGAATAT GAGGTCATTA CTATTCATAA TCTGATACAG CTTTATCCTA 3023 

AGGCCTCTCT TTAAAAACTA CACTGCATCA TAGCTTTTTT GTGCAGTTGG TCTTTCTACT 3083 

GTTACTGAAC AGTAAGCAAC CTACAGATTC ACTATCACCA ACCAGCCAGT TGATGGATCT 3143 
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TAAGCAAATT ATCAAGCTTG TGATAACCTA AATTATAAAA TGAGGGTGTT GGAATAGTTA 3203 

CATTCCAAAT CTTCTATAAC ACTCTGTATT ATATTTCTGC CTCATTCCTT GTAG GGT 3250 

Gly 



TTC TTC AGT GCC CGT GGT CAT CGA CCC CTT GAC AAG AAG AGA GAA GAG 3308 
Phe Phe Ser Ala Arg Gly His Arg Pro Leu Asp Lys Lys Arg Glu Glu 
40 45 50 55 

GCT CCC AGC CTG AGG CCT GCC CCA CCG CCC ATC AGT GGA GGT GGC TAT 3356 
Ala Pro Ser Leu Arg Pro Ala Pro Pro Pro lie Ser Gly Gly Gly Tyr 
60 65 70 

CGG GCT CGT CCA GCC AAA GCA GCT GCC ACT CAA AAG AAA GTA GAA AGA 3404 
Arg Ala Arg Pro Ala Lys Ala Ala Ala Thr Gin Lys Lys Val Glu Arg 
75 80 85 

AAA GCC CCT GAT GCT GGA GGC TGT CTT CAC GCT GAC CCA GAC CTG 3449 
Lys Ala Pro Asp Ala Gly Gly Cys Leu His Ala Asp Pro Asp Leu 
90 95 100 

GTGGGTGCAC TGATGTTTCT TGCAGTGGTG GCTCTCTCAT GCAGAGAAAG CCTGTAGTCA 3509 

TGGCAGTCTG CTAATGTTTC ACTGACCCAC ATTACCATCA CTGTTATTTT GTTTGTTTAT 3569 

TTTGGAAATA AAATTCAAAA CATAAACATA TTGGGCCTTT GGTTTAGGCT TTCTTTCTTG 3629 

TTTTCTTTGG TCTGGGCCCA AAATTTCAAA TTAGGATATG TGGGTGCCAC CTTTCCATTT 3689 

GTATTTTGCC ACTGCCTTTG TTTAGTTGGT AAAATTTTCA TAGCCCAATT ATATTTTTTC 3749 

TGGGGTAAGT AATATTTTAA ATCTCTATGA GAGTATGATG ATGACTTTCG AATTTCTGGT 3809 

CTTACAGAAA ACCAAATAAT AAATTTTTAT GTTGGCTAAT CGTATCGCTG AATTTTCCTA 3859 

TGTGCTATTT TAACAAATGT CCATGACCCA AATCCTTCAT CTAATGCCTG CTATTTTCTT 3929 

TGTTTTTAG GGG GTG TTG TGT CCT ACA GGA TGT CAG TTG CAA GAG GCT 3977 
Gly Val Leu Cys Pro Thr Gly Cys Gin Leu Gin Glu Ala 
105 110 115 



TTG CTA CAA CAG GAA AGG CCA ATC AGA AAT AGT GTT GAT GAG TTA AAT 
Leu Leu Gin Gin Glu Arg Pro lie Arg Asn Ser Val Asp Glu Leu Asn 
120 125 130 



4025 




AAC AAT GTG GAA GCT GTT TCC CAG ACC TCC TCT TCT TCC TTT CAG TAG 4073 
Asn Asn Val Glu Ala Val Ser Gin Thr Ser Ser Ser Ser Phe Gin Tyr 
135 140 145 

ATG TAT TTG GTG AAA GAG CTG TGG CAA AAG AGG CAG AAG CAA GTA AAA G 4122 
Met Tyr Leu Leu Lys Asp Leu Trp Gin Lys Arg Gin Lys Gin Val Lys 
150 155 160 

GTAGATATCC TTGTGCTTTC CATTCGATTT TCAGCTATAA AATTGGAACC GTTAGACTGC 4182 

CACGAGAATG CATGGTTGTG AGAAGATTAA CATTTCTGGG TTAGTGAATA GCATTCATAC 4242 

GCTTTTGGGC ACCTTCCCCT GCAACTTGCC AGATAAGCAC TATTCAGCTC TTATTCCCAG 4302 

TCTGACATCA GCAAGTGTGA TTTTCTATGA AAAATTCTAC TATGACTCCT TATTTTAAGT 4362 

ATACAA6AAA CTTGTGACTC AGAAGATAAT ATTTACAGAG TGGAAAAAAA CCCCTAGCAT 4422 

TTATAGTTTT AACATTTGAG GTTTTGAATG AGAGAGTTAT CCATAATATA TTCAATTGTG 4482 

TTGTGGATAA TGACACCTAA CCTGTGAATC TTGAGGTCAG AATGTTGAGT GCTGTTGACT 4542 

TGGTGGTCAG GAAACAGCTA GTGCGTGAGC CTGGCACAGG CATCTCAGTG AGTAGCATAC 4602 

CCACAGTTGG AAATTTTTCA AAGAAATCAA AGGAATCATG ACATCTTATA AATTTCAAGG 4662 

TTCTGCTATA CTTATGTGAA ATGGATAAAT AAATCAAGCA TATCCACTCT GTAAGATTGA 4722 

ACTTCTCAGA TGGAAGACCC CAATACTGCT TTCTCCTCTT TTCCCTCACC AAAGAAATAA 4782 

ACAACGTATT TCATTTATTA CTGGACACAA TCTTTAGCGT ATACCTATGG TAAATTACTA 4842. 

GTATG6TGGT TAGGATTTAT GTTAATTTGT ATATGTCATG CGCCAAATCA TTTCCACTAA 4902 

ATATGACTAT ATATCATAAC TGCTTGGTGA TAGCTCAGTG TTTAATAGTT TATTCTCAGA 4962 

AAATCAAAAT TGTATAGTTA AATACATTAG TTTTATGAGG CAAAAATGCT AACTATTTCT 5022 

ACATAATTTC ATTTTTCCAG AT AAT GAA AAT GTA GTG AAT GAG TAG TCC 5071 

Asp Asn Glu Asn Val Val Asn Glu Tyr Ser 
165 170 

TCA GAA CTG GAA AAG CAC CAA TTA TAT ATA GAT GAG ACT GTG AAT AGC 5119 
Ser Glu Leu Glu Lys His Gin Leu Tyr He Asp Glu Thr Val Asn Ser 



175 180 185 

AAT ATC CCA ACT AAC CTT CGT GTG CTT CGT TCA ATC CTG GAA AAC CTG 5167 
Asn He Pro Thr Asn Leu Arg Val Leu Arg Ser lie Leu Glu Asn Leu 
190 195 200 205 

AGA AGC AAA ATA CAA AAG TTA GAA TCT GAT GTC TCA GCT CAA ATG GAA 5215 
Arg Ser Lys He Gin Lys Leu Glu Ser Asp Val Ser Ala Gin Met Glu 
210 215 220 

TAT TGT C6C ACC CCA TGC ACT GTC AGT TGC AAT ATT CCT GTG GTG TCT 5263 
Tyr Cys Arg Thr Pro Cys Thr Val Ser Cys Asn He Pro Val Val Ser 
225 230 235 

GGC AAA G GTAACTGATT CATAAACATA TTTTTAGAGA GTTCCAGAAG AACTCACACA 5320 
Gly Lys 



CCAAAAATAA GAGAACAACA ACAACAACAA AAATGCTAAG TGGATTTTCC CAACAGATCA 5380 

TAATGACATT ACAGTACATC ATAAAAATAT CCTTAGCCAG TTGTGTTTTG GACTGGCCTG 5440 

GTGCATTTGC TGGTTTTGAT GAGCAGGATG GGGCACAGGT AGTCCCAGGG GTGGCTGATG 5500 

TGTGCATCTG CGTACTGGCT TGAACAGATG GCAGAACCAC AGATAGATGT AGAAGTTTCT 5560 

CCATTTTGTG TGTTCTGGGA GCTCATGGAT ATTCCAGGAC ACAAAAGGTG GAGAAGAGCT 5620 

TTGTTCATCC TCTTAGCAGA TAAACGTCCT CAAAACTGGG TTGGACTTAC TAAAGTAAAA 5680 

TGAAAATCTA ATATTTGTTA TATTATTTTC AAAGGTCTAT AATAACACAC TCCTTAGTAA 5740 

CTTATGTAAT GTTATTTTAA AGAATTGGTG ACTAAATACA AAGTAATTAT GTCATAAACC 5800 

CCTGAACATA ATGTTGTCTT ACATTTGCAG AA TGT GAG GAA ATT ATC AGG AAA 5853 

Glu Cys Glu Glu He He Arg Lys 
240 245 

GGA GGT GAA ACA TCT GAA ATG TAT CTC ATT CAA CCT GAC AGT TCT GTC 5901 
Gly Gly Glu Thr Ser Glu Met Tyr Leu He Gin Pro Asp Ser Ser Val 
250 255 250 

AAA CCG TAT AGA GTA TAC TGT GAC ATG AAT ACA GAA AAT GGA G 5944 
Lys Pro Tyr Arg Val Tyr Cys Asp Met Asn Thr Glu Asn Gly 
265 270 275 




GTAAGCTTTC GACAGTTGTT GACCTGTTGA TCTGTAATTA TTTGGATACC GTAAAATGCC 5004 

AGGAAACAAG GCCAGGTGTG GTGGCTCATA CCTGTAATTC CAGCACCTTG GGAGGCCAAA 6064 

GTG6GCTGAT AGCTTGAGCC TAGGAGTTTG AAACTAGCCT GGGCAACATA ATGAGACCCT 6124 

AACTCTACAA AAAAAAAAAA AATACCAAAA AAAAAAAAAA AATCA6CTGT GTTGGTAGTA 5184 

TGTGCCTGTA GTCCCA6CTA TCCAGGAGGC TGAGATGGGA GATCACCT6A GCCCACAACC 6244 

TGGAGTCTTG ATCATGCTAC TGAACTGTAG CCTGGGCAAC AGAGGATAGT GAGATCCTGT 5304 

CTCAAAAAAA AAAATTAATT AAAAAGCCAG GAAACAAGAC TTAGCTCTAA CATCTAACAT 5364 

AGCTGACAAA GGAGTAATTT GATGTGGAAT TCAACCTGAT ATTTAAAAGT TATAAAATAT 5424 

CTATAATTCA CAATTTGGGG TAAGATAAAG CACTTGCAGT TTCCAAAGAT TTTACAAGTT 6484 

TACCTCTCAT ATTTATTTCC TTATTGTGTC TATTTTAGAG CACCAAATAT ATACTAAATG 5544 

GAATGGACAG GGGATTCAGA TATTATTTTC AAAGTGACAT TATTTGCTGT TGGTTAATAT 6604 

ATGCTCTTTT TGTTTCTGTC AACCAAAG GA TGG ACA GTG ATT CAG AAC CGT 6655 

Gly Trp Thr Val lie Gin Asn Arg 
280 285 

CAA GAC GGT AGT GTT GAC TTT GGC AGG AAA TGG GAT CCA TAT AAA CAG 6703 
Gin Asp Gly Ser Val Asp Phe Gly Arg Lys Trp Asp Pro Tyr Lys Gin 
290 295 300 

GGA-TTT GGA AAT GTT GCA ACC AAC ACA GAT GGG AAG AAT TAC TGT GGC 6751 
Gly Phe Gly Asn Val Ala Thr Asn Thr Asp Gly Lys Asn Tyr Cys Gly 
305 310 315 

CTA CCA G GTAACGAACA GGCATGCAAA ATAAAATCAT TCTATTTGAA ATGGGATTTT 6808 
Leu Pro 

TTTTAATTAA AAAACATTCA TTGTTGGAAG CCTGTTTTAG GCAGTTAAGA GGAGTTTCCT 6868 

GACAAAAATG TGGAAGCTAA AGATAAGGGA AGAAAGGCAG TTTTTAGTTT CCCAAAATTT 5928 

TATTTTTGGT GAGAGATTTT ATTTTGTTTT TCTTTTAG GT GAA TAT TGG CTT 6980 

Gly Glu Tyr Trp Leu 



320 

GGA AAT GAT AAA ATT AGC CAG CTT ACC AGG ATG GGA CCC ACA GAA CTT. 7028 
Gly Asn Asp Lys lie Ser Gin Leu Thr Arg Met Gly Pro Thr Glu Leu 
325 330 335 340 

TTG ATA GAA ATG GAG GAC TGG AAA GGA GAC AAA GTA AAG GCT CAC TAT 7076 
Leu lie Glu Met Glu Asp Trp Lys Gly Asp Lys Val Lys Ala His Tyr 
345 350 355 

GGA GGA TTC ACT GTA CAG AAT GAA GCC AAC AAA TAC CAG ATC TCA GTG 7124 
Gly Gly Phe Thr Val Gin Asn Glu Ala Asn Lys Tyr Gin He Ser Val 
360 365 370 

AAC AAA TAC AGA GGA ACA GCC GGT AAT GCC CTC ATG GAT GGA GCA TCT 7172 
Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met Asp Gly Ala Ser 
375 380 385 

CAG CTG ATG GGA GAA AAC AGG ACC ATG ACC ATT CAC AAC GGC ATG TTC 7220 
Gin Leu Met Gly Glu Asn Arg Thr Met Thr He His Asn Gly Met Phe 
390 395 400 

TTC AGC ACG TAT GAC AGA GAC AAT GAC GGC TG GTATGTGTGG 7262 
Phe Ser Thr Tyr Asp Arg Asp Asn Asp Gly Trp 
405 410 415 

CACTCTTTGC TCCTGCTTTA AAAATCACAC TAATATCATT ACTCAGAATC ATTAACAATA 7322 

TTTTTAATAG CTACCACTTC CTGGGCACTT ACTGTCAGCC ACTGTCCTAA GCTCTTTATG 7382 

CATCACTCGA AAGCATTTCA ACTATAAGGT AGACATTCTT ATTCTCATTT TACAGATGAG 7442 

ATTTAGAGAG ATTACGTGAT TTGTCCAATG TCACACAACT ACCCAGAGAT AAAACTAGAA 7502 

TTTGAGCACA GTTACTTTCT GAATAATGAG CATTTAGATA AATACCTATA TCTCTATATT 7552 

CTAAAGTGTG TGTGAAAACT TTCATTTTCA TTTCCAGGGT TCTCTGATAC TAAGGGTTGT 7622 

AAAAGCTATT ATTCCAGTAT AAAGTAACAA ACACAGTCCC TAGATGGATT GCCACAAAGG 7682 

CCCAGTTATC TCTCTTTCTT GCTATAGGGC ACAGGAGGTC TTTGGTGTAT TAGTGTGACT 7742 

CTATGTATAG CACCCAAAGG AAAGACTACT GTGCACACGA GTGTAGCAGT CTTTTATGGG 7802 

TAATCTGCAA AACGTAACTT GACCACCGTA GTTCTGTTTC TAATAACGCC AAACACATTT 7862 



TCTTTCAG G TTA ACA TCA GAT CCC AGA AAA CAG TGT TCT AAA GAA GAC 7910 
Leu Thr Ser Asp Pro Arg Lys Gin Cys Ser Lys Glu Asp 
420 425 

GGT GGT GGA TGG TGG TAT AAT AGA TGT CAT GCA GCC AAT CCA AAC GGC 7958 
Gly Gly Gly Trp Trp Tyr Asn Arg Cys His Ala Ala Asn Pro Asn Gly 
430 435 440 

AGA TAC TAC TGG GGT GGA CAG TAC ACC TGG GAC ATG GCA AAG CAT GGC 8006 
Arg Tyr Tyr Trp Gly Gly Gin Tyr Thr Trp Asp Met Ala Lys His Gly 
445 450 455 460 

ACA GAT GAT GGT GTA GTA TGG ATG AAT TGG AAG GGG TCA TGG TAC TCA 8054 
Thr Asp Asp Gly Val Val Trp Met Asn Trp Lys Gly Ser Trp Tyr Ser 
465 470 475 

ATG AGG AAG ATG AGT ATG AAG ATC AGG CCC TTC TTC CCA CAG CAA TAGTCCCCAA 
8109 

Met Arg Lys Met Ser Met Lys lie Arg Pro Phe Phe Pro Gin Gin 
480 485 490 

TACGTAGATT TTTGCTCTTC TGTATGTGAC AACATTTTTG TACATTATGT TATTGGAATT 8169 

TTCTTTCATA CATTATATTC CTCTAAAACT CTCAAGCAGA CGTGAGTGTG ACTTTTTGAA 8229 

AAAAGTATAG GATAAATTAC ATTAAAATAG CACATGATTT TCTTTTGTTT TCTTCATTTC 8289 

TCTTGCTCAC CCAAGAAGTA ACAAAAGTAT AGTTTTGACA GAGTTGGTGT TCATAATTTC 8349 

AGTTCTAGTT GATTGCGAGA ATTTTCAAAT AAGGAAGAGG GGTCTTTTAT CCTTGTCGTA 8409 

GGAAAACCAT GACGGAAAGG AAAAACTGAT GTTTAAAAGT CCACTTTTAA AACTATATTT 8469 

ATTTATGTAG GATCTGTCAA AGAAAACTTC CAAAAAGATT TATTAATTAA ACCAGACTCT 8529 

GTTGCAATAA GTTAATGTTT TCTTGTTTTG TAATCCACAC ATTCAATGAG TTAGGCTTTG 8589 

CACTTGTAAG GAAGGAGAAG CGTTCACAAC CTCAAATAGC TAATAAACCG 6TCTTGAATA 8649 

TTTGAAGATT TAAAATCTGA CTCTAGGACG GGCACGGTGG CTCACGACTA TAATCCCAAC 8709 

ACTTTGGGAG GCTGAGGCGG GCGGTCACAA GGTCAGGAGT TCAAGACCAG CCTGACCAAT 8769 

ATGGTGAAAC CCCATCTCTA CTAAAAATAC AAAAATTAGC CAGGCGTGGT GGCAGGTGCC 8829 



TGTAGGTCCC AGCTAGCCTG TGAGGTGGAG ATTGCATTGA GCCAAGATC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Lys Arg Met Val Ser Trp Ser Phe His Lys Leu Lys Thr Met Lys 
15 10 15 

His Leu Leu Leu Leu Leu Leu Cys Val Phe Leu Val Lys Ser Gin Gly 
20 25 30 

Val Asn Asp Asn Glu Glu Gly Phe Phe Ser Ala Arg Gly His Arg Pro 
35 40 45 

Leu Asp Lys Lys Arg Glu Glu Ala Pro Ser Leu Arg Pro Ala Pro Pro 
50 55 60 

Pro He Ser Gly Gly Gly Tyr Arg Ala Arg Pro Ala Lys Ala Ala Ala 
65 70 75 80 

Thr Gin Lys Lys Val Glu Arg Lys Ala Pro Asp Ala Gly Gly Cys Leu 
85 90 95 ■ 

His Ala Asp Pro Asp Leu Gly Val Leu Cys Pro Thr Gly Cys Gin Leu 
100 105 110 

Gin Glu Ala Leu Leu Gin Gin Glu Arg Pro He Arg Asn Ser Val Asp 
115 120 125 

Glu Leu Asn Asn Asn Val Glu Ala Val Ser Gin Thr Ser Ser Ser Ser 
130 135 140 

Phe Gin Tyr Met Tyr Leu Leu Lys Asp Leu Trp Gin Lys Arg Gin Lys 
145 150 155 160 




57 



Gin Val Lys Asp Asn Glu Asn Val Val Asn Glu Tyr Ser Ser Glu Leu 
165 170 175 

Glu Lys His Gin Leu Tyr He Asp Glu Thr Val Asn Ser Asn He Pro 
180 185 190 

Thr Asn Leu Arg Val Leu Arg Ser He Leu Glu Asn Leu Arg Ser Lys 
195 200 205 

He Gin Lys Leu Glu Ser Asp Val Ser Ala Gin Met Glu Tyr Cys Arg 
210 215 220 

Thr Pro Cys Thr Val Ser Cys Asn He Pro Val Val Ser Gly Lys Glu 
225 230 235 240 

Cys Glu Glu He He Arg Lys Gly Gly Glu Thr Ser Glu Met Tyr Leu 
245 250 255 

He Gin Pro Asp Ser Ser Val Lys Pro Tyr Arg Val Tyr Cys Asp Met 
260 265 270 

Asn Thr Glu Asn Gly Gly Trp Thr Val He Gin Asn Arg Gin Asp Gly 
275 280 285 

Ser Val Asp Phe Gly Arg Lys Trp Asp Pro Tyr Lys Gin Gly Phe Gly 
290 295 300 

Asn Val Ala Thr Asn Thr Asp Gly Lys Asn Tyr Cys Gly Leu Pro Gly 
305 310 315 320 

Glu Tyr Trp Leu Gly Asn Asp Lys He Ser Gin Leu Thr Arg Met Gly 
325 330 335 

Pro Thr Glu Leu Leu He Glu Met Glu Asp Trp Lys Gly Asp Lys Val 
340 345 350 

Lys Ala His Tyr Gly Gly Phe Thr Val Gin Asn Glu Ala Asn Lys Tyr 
355 360 365 



Gin He Ser Val Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met 
370 375 380 



Asp Gly Ala Ser Gin Leu Met Gly Glu Asn Arg Thr Met Thr He His 
385 390 395 400 



Asn Gly Met Phe Phe Ser Thr Tyr 
405 

Thr Ser Asp Pro Arg Lys Gin Cys 
420 

Trp Tyr Asn Arg Cys His Ala Ala 

435 440 

Gly Gly Gin Tyr Thr Trp Asp Met 
450 455 

Val Val Trp Met Asn Trp Lys Gly 
465 470 

Ser Met Lys lie Arg Pro Phe Phe 
485 



Asp Arg Asp Asn Asp Gly Trp Leu 
410 415 

Ser Lys Glu Asp Gly Gly Gly Trp 
425 430 

Asn Pro Asn Gly Arg Tyr Tyr Trp 
445 

Ala Lys His Gly Thr Asp Asp Gly 
460 

Ser Trp Tyr Ser Met Arg Lys Met 
475 480 

Pro Gin Gin 
490 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: human fibrinogen gamma chain 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(1799. . 1876, 1973.. 2017, 2207.. 2390, 2510 

..2603, 4211.. 4341, 4645.. 4778, 5758.. 5942, 7426 
..7703, 9342.. 9571) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
CTACACACTT CTTGAAGGCA AAGGCAATGC TGAAGTCACC TTTCATGTTC AAATCATATT 60 



AAAAAGTTAG CAAGATGTAA TTATCAGTGT ACTATGTAAA TCTTTGTGAA TGATCAATAA 120 



TTACATATTT TCATTATATA TATTTTAGTA GATAATATTT ATATACATTC AACATTCTAA 180 

ATATAGAAAG TTTACAGAGA AAAATAAAGC CTTTTTTTCC AATCCTGTCC TCCACCTCTG 240 

CATCCCATTC TTCTTCACAG AGGCAACTGA TTCAAGTCAT TACATAGTTA TTGA6TGTTA 300 

ACTACAACTA TGTTAAGTAC AGCTATATAT GTTAGATGCC GTAGCCACAG AAATCAGTTT 360 

ACAATCTAAT GCAGTGGATA CAGCATGTAT ACATATAATA TAAG6TTGCT ACAAATGCTA 420 

TCTGAGGTAG AGCTGTTTGA AAGAATACTA ATACTTAAAT GTTTAATTCA ACTGACTTGA 480 

TTGACAACTG ATTAGCTGAG TGGAAAAGAT GGATGAGAAA GATTGTGAGA CTTAATTGGC 540 

TGGTGGTATG GTGATATGAT TGACAATAAC TGCTAAGTCA GAGAGGGATA TATTAAGGAG 600 

GAGAAGAAAA GCAACAAATC T6GTTTTGAT GTGTTCACTT TGTTATAATT ATTGATTATT 660 

TACTGAATAT GAATATTTAT CTTTGTTTTT GAGTCAATAA ATATACCTTT GTAAAGACAG 720 

AATTAAAGTA TTAGTATTTC TTTCAAACTG GAGGCATTTC TCCCACTAAC ATATTTCATC 780 

AAAACTTATA ATAAGCTTGG TTCCAGAGGA AGAAATGAGG GATAACCAAA AATAGAGACA 840 

TTAATAATA6 TGTAACGCCC AGTGATAAAT CTCAATAGGC AGTGATGACA GACATGTTTT 900 

CCCAAACACA AGGATGCTGT AAGGGCCAAA CAGAAATGAT GGCCCCTCCC CAGCACCTCA 960 

TTTTGCCCCT TCCTTCAGCT ATGCCTCTAC TCTCCTTTAG ATACAAGGGA GGTGGATTTT 1020 

TCTGTTGTCT GAGATAGCTT GATGGAACCA CAGGAACAAT GAAGTGGGCT CCTGGCTCTT 1080 

TTCTCTGTGG CAGATGGGGT GCCATGCCCA CCTTCAGACA AAGGGAAGAT TGAGCTCAAA 1140 

AGCTCCCTGA GAAGTGAGAG CCTATGAACA TGGTTGACAC AGAGGGACAG GAATGTATTT 1200 

CCAGGGTCAT TCATTCCTGG GAATAGTGAA CTGGGACATG G6GGAAGTCA GTCTCCTCCT 1260 

GCCACAGCCA CAGATTAAAA ATAATAATGT TAACTGATCC CTAGGCTAAA ATAATAGTGT 1320 

TAACTGATCC CTAAGCTAAG AAAGTTCTTT TGGTAATTCA GGTGATGGCA GCAGGACCCA 1380 

TCTTAAGGAT AGACTAGGTT TGCTTAGTTC GAGGTCATAT CTGTTTGCTC TCAGCCATGT 1440 



ACTGGAAGAA GTTGCATCAC ACAGCCTCCA GGACTGCCCT CCTCCTCACA GCAATGGATA 1500 

ATGCTTCACT AGCCTTTGCA GATAATTTTG GATCAGAGAA AAAACCTTGA GCTGGGCCAA 1560 

AAAGGAGGAG CTTCAACCTG TGTGCAAAAT CTGGGAACCT GACAGTATAG GTTGGGGGCC 1620 

AGGATGAGGA AAAAGGAACG GGAAAGACCT GCCCACCCTT CTGGTAAGGA GGCCCCGTGA 1680 

TCAGCTCCAG CCATTTGCAG TCCTGGCTAT CCCAGGAGCT TACATAAAGG GACAATTGGA 1740 

GCCTGAGAGG TGACAGTGCT GACACTACAA GGCTCGGAGC TCCGGGCACT CAGACATC 1798 

ATG AGT TGG TCC TTG CAC CCC CGG AAT TTA ATT CTC TAC TTC TAT GCT 1846 
Met Ser Trp Ser Leu His Pro Arg Asn Leu He Leu Tyr Phe Tyr Ala 
15 10 15 

CTT TTA TTT CTC TCT TCA ACA TGT GTA GCA GTAAGTGTGC TCTTCACAAA 1895 
Leu Leu Phe Leu Ser Ser Thr Cys Val Ala 
20 25 

ACGTTGTTTA AAATGGAAAG CTGGAAAATA AAACAGATAA TAAACTAGTG AAATTTTCGT 1956 

ATTTTTTCTC TTTTAG TAT GTT GCT ACC AGA GAC AAC TGC TGC ATC TTA 2005 

Tyr Val Ala Thr Arg Asp Asn Cys Cys lie Leu 
30 35 

GAT GAA AGA TTC GTAAGTAGTT TTTATGTTTC TCCCTTTGTG TGTGAACTGG 2057 
Asp Glu Arg Phe 
40 

AGAGGGGCAG AGGAATAGAA ATAATTCCCT CATAAATATC ATCTGGCACT TGTAACTTTT 2117 

TAAAAACATA GTCTAGGTTT TACCTATTTT TCTTAATAGA TTTTAAGAGT AGCATCTGTC 2177 

TACATTTTTA ATCACTGTTA TATTTTCAG GGT AGT TAT TGT CCA ACT ACC TGT 2230 

Gly Ser Tyr Cys Pro Thr Thr Cys 
45 

GGC ATT GCA GAT TTC CTG TCT ACT TAT CAA ACC AAA GTA GAC AAG GAT 2278 
Gly He Ala Asp Phe Leu Ser Thr Tyr Gin Thr Lys Val Asp Lys Asp 
50 55 60 65 

CTA CAG TCT TTG GAA GAC ATC TTA CAT CAA GTT GAA AAC AAA ACA TCA 2326 
Leu Gin Ser Leu Glu Asp He Leu His G1n Val Glu Asn Lys Thr Ser 
70 75 80 




GAA GTC AAA CAG CTG ATA AAA GCA ATC CAA CTC ACT TAT AAT CCT GAT 2374 
Glu Val Lys Gin Leu He Lys Ala He Gin Leu Thr Tyr Asn Pro Asp 
85 90 95 

GAA TCA TCA AAA CCA A GTGA6AAAAT AAAGACTACT GACCAAAAAA 2420 
Glu Ser Ser Lys Pro 
100 

TAATAATAAT AATCTGTGAA GTTCTTTTGC TGTTGTTTTA GTTGTTCTAT TTGCTTAAGG 2480 

ATTTTTATGT CTCTGATCCT ATATTACAG AT ATG ATA GAC GCT GCT ACT TTG 2532 

Asn Met He Asp Ala Ala Thr Leu 
105 110 

AAG TCC AGG ATA ATG TTA GAA GAA ATT ATG AAA TAT GAA GCA TCG ATT 2580 
Lys Ser Arg He Met Leu Glu Glu He Met Lys Tyr Glu Ala Ser He 
115 120 125 

TTA ACA CAT GAC TCA AGT ATT CG GTAAGGATTT TTGTTTTAAT TTGCTCTGCA 2633 
Leu Thr His Asp Ser Ser He Arg 
130 

AGACTGATTT AGTTTTTATT TAATATTCTA TACTTGAGTG AAAGTAATTT TTAATGTGTT 2693 

TTCCCCATTT ATAATATCCC AGTGACATTA TGCCTGATTA TGTTGAGCAT AGTAGAGATA 2753 

GAAGTTTTTA GTGCAATATA AATTATACTG GGTTATAATT GCTTATTAAT AATCACATTG 2813 

AAGAAAGATG TTCTAGATGT CTTCAAATGC TAGTTTGACC ATATTTATCA AAAATTTTTT 2873 

CCCCATCCCC CATTTATCTT ACAACATAAA ATCAATCTCA TAGGAATTTG GGTGTTGAAA 2933 

ATAAAATCCT CTTTATAAAA ATGCTGACAA ATTGGTGGTT AAAAAAATTA GCAAGCAGAG 2993 

GCATA6TAAG GATTTTGGCT CCTAAAGTAA ATTATATTGA ATGTGGAGCA GGAAGAAACA 3053 

TGTCTTGA6A GACTAAGTGT 6GCAAATATT GCAAAGCTCA TATTGATCAT TGCAGAATGA 3113 

ACCTGCATAG TCTCTTCCCT TCATTTGGAA GTGAATGTCT CTGTTAAAGC TTCTCAGGGA 3173 

CTCATAAACT TTCTGAACAT AAGGTCTCAG ATACAGTTTT AATATTTTTC CCCAATTTTT 3233 



TTTTCTGAAT TTTTCTCAAA GCAGCTTGAG AAATTGAGAT AAATAGTAGC TAGGGAGAAG 3293 




TGGCCCAGGA AAGATTTCTC CTCTTTTTGC TATCAGAGGG CCCTTGTTAT TATTGTTATT 3353 

ATTATTACTT GCATTATTAT TGTCCATCAT TGAAGTTGAA GGAGGTTATT GTACAGAAAT 3413 

TGCCTAAGAC AAGGTAGAGG GAAAACGTGG ACAAATAGTT TGTCTACCCT TTTTTACTTC 3473 

AAAGAAAGAA CGGTTTATGC ATTGTAGACA GTTTTCTATC ATTTTTGGAT ATTTGCAAGC 3533 

CACCCTGTAA GTAACTACAA AAGGAGGGTT TTTACTTCCC CCAGTCCATT CCCAAAGCTA 3593 

TGTAACCAGA AGCATTAAAG AAGAAAGGGG AAGTATCTGT TGTTTTATTT TACATACAAT 3653 

AACGTTCCAG ATCATGTCCC TGTGTAAGTT ATATTTTAGA TTGAAGCTTA TATGTATAGC 3713 

CTCAGTAGAT CCACAAGTGA AAGGTATACT CCTTCAGCAC ATGTGAATTA CTGAACTGAG 3773 

CTTTTCCTGC TTCTAAAGCA TCAGGGGGTG TTCCTATTAA CCAGTCTCGC CACTCTTGCA 3833 

GGTTGCTATC TGCTGTCCCT TATGCATAAA GTAAAAAGCA AAATGTCAAT GACATTTGCT 3893 

TATTGACAAG GACTTTGTTA TTTGTGTTGG GAGTTGAGAC AATATGCCCC ATTCTAAGTA 3953 

AAAAGATTCA GGTCCACATT GTATTCCTGT TTTAATTGAT TTTTTGATTT GTTTTTCTTT 4013 

TTCAAAAAGT TTATAATTTT AATTCATGTT AATTTAGTAA TATAATTTTA CATTTTCCTC 4073 

AAGAATGGAA TAATTTATCA GAAAGCACTT CTTAAGAAAA TACTTAGCAG TTTCCAAAGA 4133 

AAATATAAAA TTACTCTTCT GAAAGGAATA CTTATTTTTG TCTTCTTATT TTTGTTATCT 4193 

TATGTTTCTG TTTGTAG A TAT TTG CAG GAA ATA TAT AAT TCA AAT AAT CAA 4244 

Tyr Leu Gin Glu lie Tyr Asn Ser Asn Asn Gin 
135 140 145 

AAG ATT GTT AAC CTG AAA GAG AAG GTA GCC CAG CTT GAA GCA CAG TGC 4292 
Lys He Val Asn Leu Lys Glu Lys Val Ala Gin Leu Glu Ala Gin Cys 
150 155 160 

CAG GAA CCT TGC AAA GAC ACG GTG CAA ATC CAT GAT ATC ACT GGG AAA G 4341 
Gin Glu Pro Cys Lys Asp Thr Val Gin He His Asp lie Thr Gly Lys 
165 170 175 

GTAACTGATG AAGGTTATAT TGGGATTAGG TTCATCAAAG TAAGTAATGT AAAGGAGAAA 4401 

GTATGTACTG GAAAGTATAG GAATAGTTTA GAAAGTGGCT ACCCATTAAG TCTAAGAATT 4461 




TCAGTTGTCT AGACCTTTCT TGAATAGCTA AAAAAAACAG TTTAAAAGGA ATGCTGATGT 4521 

GAAAAGTAAG AAAATTATTC TTGGAAAATG AATAGTTTAC TACATGTTAA AAGCTATTTT 4581 

TCAAGGCTGG CACAGTCTTA CCTGCATTTC AAACCACAGT AAAAGTCGAT TCTCCTTCTC 4641 

TAG AT TGT CAA GAC ATT GCC AAT AAG GGA GCT AAA CAG AGC GGG CTT 4688 
Asp Cys Gin Asp He Ala Asn Lys Gly Ala Lys Gin Ser Gly Leu 
180 185 190 

TAG TTT ATT AAA CCT CTG AAA GCT AAC CAG CAA TTC TTA GTC TAC TGT 4736 
Tyr Phe He Lys Pro Leu Lys Ala Asn Gin Gin Phe Leu Val Tyr Cys 
195 200 205 

GAA ATC GAT GGG TCT GGA AAT GGA TGG ACT GTG TTT CAG AAG 4778 
Glu He Asp Gly Ser Gly Asn Gly Trp Thr Val Phe Gin Lys 
210 215 220 

GTAATTTTTT CCCCACCATG TGTATTTAAT AAATTCCTAC ATTGTTTCTG CCATATGGCA 4838 

GATACTTTTC TAAGCACCTT GTGAACCGTA GCTCATTTAA TCCTTGCAAT AGCCCTAAGA 4898 

GGAAGGTACT TCTGTTACTC CTATTTACAG AAAAGGAAAC TGAGGCACAC AAGGTTAAAT 4958 

AACTTGCCCA AGACCACATA ACTAATAAGC AACAGAGTCA GCATTTGAAC CTAGGCAGTA 5018 

TAGTTTCAGA GTTTGTGACT TGACTCTATA TTGTACTGGC ACTGACTTTG TAGATTCATG 5078 

GTGGCACATA ATCATAGTAC CACAGTGACA AATAAAAAGA AGGAAACTCT TTTGTCAGGT 5138 

AGGTCAAGAC CTGAGGTTTC CCATCAGAAG ATGAGGAAGC CCAACACCAC CCCCCACCAC 5198 

CCCACCACCA TCACCACCCT TTCACACACC AGAGGATACA CTTGGGCTGC TCCAAGACAA 5258 

GGAACCTGTG TTGCATCTGC CACTTGCTGA TACCCACTAG GAATCTTGGC TCCTTTACTT 5318 

TCTGTTTACC TCCCACCACT GTTATAACTG TTTCTACAGG GGGCGCTCAG AGGGAATGAA 5378 

TGGTGGAAGC ATTAGTTGCC AGACACCGAT TGAGCAATGG GTTCCATCAT AAGTGTAAGA 5438 

ATCAGTAATA TCCAGCTAGA GTTCTGAAGT CGTCTAGGTG TCTTTTTAAT ATTACCACTC 5498 

ATTTAGAATT TATGATGTGC CAGAAACCCT CTTAAGTATT TCTCTTATAT TCTCTCTCAT 5558 




GATCCTTGCA GCAACCCTAA GAAGTAACCA TCATTTTTCC TATTTGATAC ATGAGGAAAC 5618 

TGAGGTAGCT TGGCCAAGAT CACTTAGTTG GGAGTTGATA GAACCAGTGC TCTGTATTTT 5678 

TGACAAAATG TTGACAGCAT TCTCTTTACA TGCATTGATA GTCTATTTTC TCCTTTTGCT 5738 

CTTGCAAATG TGTAATTAG AGA CTT GAT GGC AGT GTA GAT TTC AAG AAA AAC 5790 

Arg Leu Asp Gly Ser Val Asp Phe Lys Lys Asn 
225 230 

TGG ATT CAA TAT AAA GAA GGA TTT GGA CAT CTG TCT CCT ACT GGC ACA 5838 
Trp He Gin Tyr Lys Glu Gly Phe Gly His Leu Ser Pro Thr Gly Thr 
235 240 245 

ACA GAA TTT TGG CTG GGA AAT GAG AAG ATT CAT TTG ATA AGC ACA CAG 5886 
Thr Glu Phe Trp Leu Gly Asn Glu Lys He His Leu He Ser Thr Gin 
250 255 260 265 

TCT GCC ATC CCA TAT GCA TTA AGA GTG GAA CTG GAA GAC TGG AAT GGC 5934 
Ser Ala He Pro Tyr Ala Leu Arg Val Glu Leu Glu Asp Trp Asn Gly 
270 275 280 

AGA ACC AG GTACTGTTTT GAAATGACTT CCAACTTTTT ATTGTAAAGA 5982 
Arg Thr Ser 

TTGCCTGGAA TGTGCACTTT CCAACTATCA ATAGACAATG GCAAATGCAG CCTGACAAAT 6042 

GCAAACAGCA CATCCAGCCA CCATTTTCTC CAGGAGTCTG TTTGGTTCTT GGGCAATCCA 6102 

AAAAGGTAAA TTCTATTCAG GATGAATCTA AGTGTATTGG TACAATCTAA TTACCCTGGA 6162 

ACCATTCAGA GTAATAGCTA ATTACTGAAC TTTTAATCAG TCCCAGGAAT TGAGCATAAA 6222 

ATTATAATTT TATCTAGTCT AAATTACTAT TTCATGAAGC AGGTATTATT ATTAATCCCA 6282 

TTTTATAGAT TAACTTGCTC AAAGTCACAT TGCTGATAAG TGGTAGAGGT AGAATTCAGA 6342 

CTCAAGTAGT TTAACTTTAG AGCCTGTCCT CTTAACAACT ATCCTGGTTG AAAAGCAAAT 6402 

ACAGCCTCTT CAGACTTCTC AGTGCCTTGA TGGCCATTTA TTCTGTCAAA TCATGAGCTA 6462 

CCCTAAAAGT AAACCAGCTA GCTCTTTTGA TGATCTAGAG 6CTTCTTTTT GCTTGAGATA 6522 

TTTGAAGGTT TTAAGCATTG TTACCTAATT AAAATGCAGA AAAATATCCA ACCCTCTTGT 6582 




TATGTTTAAG 6AATAGTGAA ATATATTGTC TTCAAACACA TGGACTTTTT TTTATTGCTT 6642 

GGTTGGTTTT TAATCCAGAA AGTGCTATAG TCAGTAGACC TTCTTCTAGG AAAGGACCTT 6702 

CCATTTCCCA GCCACTGGAG ATTAGAAAAT AAGCTAAATA TTTTCTGGAA ATTTCTGTTC 6762 

ATTCATTAAG GCCCATCCTT TCCCCCACTC TATAGAAGTG TTGTCCACTT GCACAATTTT 6822 

TTCCAGGAAA GAATCTCTCT AACTCCTTCA GCTCACATGC TTTGGACCAC ACAGGGAAGA 6882 

CTTTGATTGT GTAATGCCCT CAGAAGCTCT CCTTCTTGCC ACTACCACAC TGATTTGAGG 6942 

AAGAAAATCC CTTTAGCACC TAACCCTTCA GGTGCTATGA GTGGCTAATG GAACTGTACC 7002 

TCCTTCAAGT TTTGTGCAAT AATTAAGGGT CACTCACTGT CAGATACTTT CTGTGATCTA 7062 

TGATAATGTG TGTGCAACAC ATAACATTTC AATAAAAGTA GAAAATATGA AATTAGAGTC 7122 

ATCTACACAT CTGGATTTGA TCTTAGAATG AAACAAGCAA AAAAGCATCC AAGTGAGTGC 7182 

AATTATTAGT TTTCAGAGAT GCTTCAAAGG CTTCTAGGCC CATCCCGGGA AGTGTTAATG 7242 

AGCTGTGGAC TGGTTCACAT ATCTATTGCC TCTTGCCAGA TTTGCAAAAA ACTTCACTCA 7302 

ATGAGCAAAT TTCA6CCTTA AGAAACAAAG TCAAAAATTC CAAGGAAGCA TCCTACGAAA 7362 

GAGGGAACTT CTGAGATCCC TGAGGAGGGT CAGCATGTGA TGGTTGTATT TCCTTCTTCT 7422 

CAG T ACT GCA GAC TAT GCC ATG TTC AAG GTG GGA CCT GAA GCT GAC 7468 
Thr A1a Asp Tyr Ala Met Phe Lys Val Gly Pro Glu Ala Asp 
285 290 295 

AAG TAC CGC CTA ACA TAT GCC TAC TTC GCT GGT GGG GAT GCT GGA GAT 7516 
Lys Tyr Arg Leu Thr Tyr Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp 
300 305 310 

GCC TTT GAT GGC TTT GAT TTT GGC GAT GAT CCT AGT GAC AAG TTT TTC 7564 
Ala Phe Asp Gly Phe Asp Phe Gly Asp Asp Pro Ser Asp Lys Phe Phe 
315 320 325 330 



ACA TCC CAT AAT GGC ATG CAG TTC AGT ACC TGG GAC AAT GAC AAT GAT 
Thr Ser His Asn Gly Met Gin Phe Ser Thr Trp Asp Asn Asp Asn Asp 
335 340 345 
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AAG TTT GAA GGC AAC TGT GCT GAA CAG GAT GGA 
Lys Phe Glu Gly Asn Cys Ala Glu Gin Asp Gly 
350 355 



TCT GGT TGG TGG ATG 
Ser Gly Trp Trp Met 
360 
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AAC AAG TGT CAC GCT GGC CAT CTC AAT GGA GTT 
Asn Lys Cys His Ala Gly His Leu Asn Gly Val 
365 370 



TAT TAC CAA G 
Tyr Tyr Gin 
375 
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GTATGTTTTC CTTTCTTAGA TTCCAAGTTA ATGTATAGTG TATACTATTT TCATAAAAAA 7763 

TAATAAATAG ATATGAAGAA ATGAAGAATA ATTTATAAAG ATAGTAGGGA TTTTATCATG 7823 

TTCTTTATTT CAACTAAGTT CTTTGAAACT GGAAGTGGAT AATACCAAGT TCATGCCTAA 7883 

AATTAGCCCT TCTAAAGAAA TCCACCTGCT GCAAAATATC CAGTAGTTTG GCATTATATG 7943 

TGAAACTATC ACCATCATAG CTGGCACTGT GGGTTGTGGG ATCTCCTTTA GACATACAAC 8003 

ATAAATGATC TGGATGGATT AACATTACTA CATGGATGCT TGTTGACACA TTAACCTGGC 8063 

TTCCCATGAG CTTTGTGTCA GATACACGCA GTGAACAGGT GTTTGGAGGA ACAGAATAAA 8123 

GAGAAGGCAA GCACTGGTAA GGGCAGGGGT TTGTGAAAGC TTGAGAGAAG AGACCAGTCT 8183 

GAGGACAGTA GACACTTATT TTAGGATGGG GGTTGGATGA GGAGGCTATA GTTTGCTATA 8243 

AGCTTGGAAT GGTTTGGAAC ACTGGTTTCA CTCACCTACC CAGCAGTTAT GTGTGGGGAA 8303 

GCCTTACCGA TGCTAAAGGA TCCATGTTAC AATAATGGCA TTATTTGGAA ATCCCAGTGG 8363 

TATTCCATGA ATAAAACCAC TATGAAGATA ATCCCACTCA ACAGACTCTC CGTTGGAGAA 8423 

GGACAGCAAC ACCACCCTGG GAAAGCCAAA CAGTCAGACC AGACCTGTTT AGCATCAGTA 8483 

GGACTTCCCT ACCATATCTG CTGGGTAGAT GAGTGAAACC AGTGTTCCAA ACCACTCCGG 8543 

GCTTGTAGCA AACCATAGTC TCCTCATCTA CCAAGATGAG CAACCTTACC TCCTGATGTC 8603 

CTAGCCAATC ACCAACTAGG AAACTTTGCA CAGTTTATTT AAAGTAACAG TTTGATTTTC 8563 

ACAATATTTT TAAATTGGAG AAACATAACT TATCTTTGCA CTCACAAACC ACATAATGAG 8723 

AAGAAACTCT AAGGGAAAAT GCTTGATCTG TGTGACCCGG GGCGCCATGC CAGAGCTGTA 8783 

GTTCATGCCA GTGTTGTGCT CTGACAAGCC TTTTACAGAA TTACATGAGA TCTGCTTCCC 8843 




TAGGACAAGG AGAAGGCAAA TCAACAGAGG CTGCACTTTA AAATGGAGAC ATAAAATAAC 8903 

AT6CCAGAAC CATTTCCTAA AGCTCCTCAA TCAACCAACA AAATTGTGCT TTCAAATAAC 8963 

CTGAGTTGAC CTCATCAGGA ATTTTGTGGC TCCTTCTCTT CTAACCTGCC TGAAGAAAGA 9023 

TGGTCCACAG CAGCTGAGTC CGGGATGGAT AAGCTTAGGG ACAGAGGCCA ATTAGGGAAC 9083 

TTTGGGTTTC TAGCCCTACT AGTAGTGAAT AAATTTAAAG TGTGGATGTG ACTATGAGTC 9143 

ACAGCACAGA TGTTGTTTAA TAATATGTTT ATTTTATAAA TTGATATTTT AGGAATCTTT 9203 

GGAGATATTT TCAGTTAGCA GATAATACTA TAAATTTTAT GTAACTGGCA ATGCACTTCG 9263 

TAATAGACAG CTCTTCATAG ACTTGCAGAG GTAAAAAGAT TCCAGAATAA TGATATGTAC 9323 

ATCTACGACT T6TTTTAG GT GGC ACT TAG TCA AAA GCA TCT ACT CCT AAT 9373 

Gly Gly Thr Tyr Ser Lys Ala Ser Thr Pro Asn 
380 385 

GGT TAT GAT AAT GGC ATT ATT TGG GCC ACT TGG AAA ACC CGG TGG TAT 9421 
Gly Tyr Asp Asn Gly lie He Trp Ala Thr Trp Lys Thr Arg Trp Tyr 
390 395 400 

TCC ATG AAG AAA ACC ACT ATG AAG ATA ATC CCA TTC AAC AGA CTC ACA 9469 
Ser Met Lys Lys Thr Thr Met Lys He He Pro Phe Asn Arg Leu Thr 
405 410 415 

ATT GGA GAA GGA GAG CAA CAC CAC CTG GGG GGA GCC AAA CAG GTC AGA 9517 
He Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Val Arg 
420 425 430 435 

CCA GAG CAC CCT GCG GAA ACA GAA TAT GAC TCA CTT TAG CCT GAG GAT 9565 
Pro Glu His Pro Ala Glu Thr Glu Tyr Asp Ser Leu Tyr Pro Glu Asp 
440 445 450 

GAT TTG TAGAAAATTA ACTGCTAACT TCTATTGACC CACAAAGTTT CAGAAATTCT 9621 
Asp Leu 



CTGAAAGTTT CTTCCTTTTT TCTCTTACTA TATTTATTGA TTTCAAGTCT TCTATTAAGG 9681 



ACATTTAGCC TTCAATGGAA ATTAAAACTC ATTTAGGACT GTATTTCCAA ATTACTGATA 9741 
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TCAGAGTTAT TTAAAAATTG TTTATTTGAG GAGATAACAT TTCAACTTTG TTCCTAAATA 
TATAATAATA AAATGATTGA CTTTATTTGC ATTTTTATGA CCACTTGTCA TTTATTTTGT 
CTTCGTAAAT TATTTTCATT ATATCAAATA TTTTAGTATG TACTTAATAA AATAGGAGAA 
CATTTTAGAG TTTCAAATTC CCAGGTATTT TCCTTGTTTA TTACCCCTAA ATCATTCCTA 
TTTAATTCTT CTTTTTAAAT GGAGAAAATT AT6TCTTTTT AATATGGTTT TTGTTTTGTT 
ATATATTCAC AGGCTGGAGA CGTTTAAAAG ACCGTTTCAA AAGAGATTTA CTTTTTTAAA 
GGACTTTATC TGAACAGAGA GATATAATAT TTTTCCTATT GGACAATGGA CTTGCAAAGC 
TTCACTTCAT TTTAAGAGCA AAAGACCCCA TGTTGAAAAC TCCATAACAG TTTTATGCTG 
ATGATAATTT ATCTACATGC ATTTCAATAA ACCTTTTGTT TCCTAAGACT AGATACATGG 
TACCTTTATT GACCATTAAA AAACCACCAC TTTTTGCCAA TTTACCAATT ACAATTGGGC 
AACCATCAGT AGTAATTGAG TCCTCATTTT ATGCTAAATG TTATGCCTAA CTCTTTGGGA 
GTTACAAAGG AAATAGCAAT TATGGCTTTT GCCCTCTAGG AGATACAGGA CAAATACAGG 
AAAATACAGC AACCCAAACT GACAATACTC TATACAAGAA CATAATCACT AAGCAGGAGT 
CACAGCCACA CAACCAAGAT GCATAGTATC CAAAGTGCAG CTG 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Ser Trp Ser Leu His Pro Arg Asn Leu lie Leu Tyr Phe Tyr Ala 
15 10 15 

Leu Leu Phe Leu Ser Ser Thr Cys Val A1a Tyr Val Ala Thr Arg Asp 



9801 
9851 
9921 
9981 
10041 
10101 
10161 
10221 
10281 
10341 
10401 
10461 
10521 
10564 



25 



30 
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Asn Cys Cys lie Leu Asp Glu Arg Phe Gly Ser Tyr Cys Pro Thr Thr 
35 40 45 

Cys Gly He Ala Asp Phe Leu Ser Thr Tyr Gin Thr Lys Val Asp Lys 
50 55 60 

Asp Leu Gin Ser Leu Glu Asp lie Leu His Gin Val Glu Asn Lys Thr 
65 70 75 80 

Ser Glu Val Lys Gin Leu He Lys Ala He Gin Leu Thr Tyr Asn Pro 
85 90 95 

Asp Glu Ser Ser Lys Pro Asn Met He Asp Ala Ala Thr Leu Lys Ser 
100 105 110 

Arg He Met Leu Glu Glu He Met Lys Tyr Glu Ala Ser He Leu Thr 
115 120 125 

His Asp Ser Ser He Arg Tyr Leu Gin Glu He Tyr Asn Ser Asn Asn 
130 . 135 140 

Gin Lys He Val Asn Leu Lys Glu Lys Val Ala Gin Leu Glu Ala Gin 
145 150 155 160 

Cys Gin Glu Pro Cys Lys Asp Thr Val Gin He His Asp He Thr Gly 
165 170 175 

Lys Asp Cys Gin Asp He Ala Asn Lys Gly Ala Lys Gin Ser Gly Leu 
180 185 190 

Tyr Phe He Lys Pro Leu Lys Ala Asn Gin Gin Phe Leu -Val Tyr Cys 
195 200 205 

Glu He Asp Gly Ser Gly Asn Gly Trp Thr Val Phe Gin Lys Arg Leu 
2,10 215 220 

Asp Gly Ser Val Asp Phe Lys Lys Asn Trp He Gin Tyr Lys Glu Gly 
225 230 235 240 



Phe Gly His Leu Ser Pro Thr Gly Thr Thr Glu Phe Trp Leu Gly Asn 
245 250 255 



Glu Lys He His Leu He Ser Thr Gin Ser Ala He Pro Tyr Ala Leu 
260 265 270 
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Arg Val Glu Leu Glu Asp Trp Asn Gly Arg Thr Ser Thr Ala Asp Tyr 
275 280 285 

Ala Met Phe Lys Val Gly Pro Glu Ala Asp Lys Tyr Arg Leu Thr Tyr 
290 295 300 

Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp Ala Phe Asp Gly Phe Asp 
305 310 315 320 

Phe Gly Asp Asp Pro Ser Asp Lys Phe Phe Thr Ser His Asn Gly Met 
325 330 335 

Gin Phe Ser Thr Trp Asp Asn Asp Asn Asp Lys Phe Glu Gly Asn Cys 
340 345 350 

Ala Glu Gin Asp Gly Ser Gly Trp Trp Met Asn Lys Cys His Ala Gly 
355 360 365 

His Leu Asn Gly Val Tyr Tyr Gin Gly Gly Thr Tyr Ser Lys Ala Ser 
370 375 380 

Thr Pro Asn Gly Tyr Asp Asn Gly He He Trp Ala Thr Trp Lys Thr 
385 390 395 400 

Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys lie He Pro Phe Asn 
405 410 415 

Arg Leu Thr He Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys 
420 425 430 

Gin Val Arg Pro Glu His Pro Ala Glu Thr Glu Tyr Asp Ser Leu Tyr 
435 440 445 

Pro Glu Asp Asp Leu 
450 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: ovine beta-lactoglobul in 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

ACGCGTGTCG ACCTGCAGGT CAACGGATCT CTGTGTCTGT TTTCATGTTA GTACCACACT 60 

GTTTTGGTGG CTGTAGCTTT CAGCTACAGT CTGAAGTCAT AAAGCCTGGT ACCTCCAGCT 120 

CTGTTCTCTC TCAAGATTGT GTTCTGCTGT TTGGGTCTTT AGTGTCTCCA CACAATTTTT 180 

AGAATTGTTT GTTCTAGTTC TGTGAAAAAT GATGCTGGTA TTTTGATAAG GATTGCATTG 240 

AATCTGTAAA GCTACAGATA TAGTCATTGG GTAGTACAGT CACTTTAACA ATATTAACTC 300 

TTCACATCTG TGAGCATGAT ATATTTTCCC CCTCTATATC ATCTTCAATT CCTCCTATCA 360 

GTTTCTTTCA TTGCAGTTTT CTGAGTACAG GTCTTACACC TCCTTGGTTA GAGTCATTCC 420 

TCAGTATTTT ATTCCTTTGA TACAATTGTG AATGAGGTAA TTTTCTTAGT TTCTCTTTCT 480 

GATAGCTCAT TGTTAGTGTA TATATAGAAA AGCAACAGAT TTCTATGTAT TAATTTTGTA 540 

TCCTGCAACA GATTTCTATG TATTAATTTT GTATCCTGCT ACTTTACGGA ATTCACTTAT 600 

TAGCTTTTTG GTGACATCTT GAGGATTTTC TGAAGAAAAT GGCATGGTAT GGTAGGACAA 660 

GGTGTCATGT CATCTGCAAA CAGTGGCAGT TTTCCTTCTT CCCTTCCAAC CTGGATTTCT 720 

TTGATTTCTT TCTGTCTGAG TACGACTAGG ATTCCCAATA CTATACCGAA TAAAAGTGGC 780 

AAGA6TGGAC ATCCTTGTCT TATTTTTCTG ACCTTAGAGG AAATGCTTTC AGTTTTTCAC 840 

CATTAATTAT AATGTTTACT GTGGGCTTGT CATATGTGGC CTTCATTATA TGGAGGTCTA 900 

TTCCCTCTAT ACCCACCTTG TTGA6AGTTT TTATCATAAA AGTATGTTGA ATTTTGTCAA 960 

AAGTTTTTCC TGCATCTATT GAGATGATTT TTACTCTTCA ATTCATTAAT GATTTTTATT 1020 

CTTCATTTTG TTAATGATTT CCATTCTTCA ATTTGTTAAC GTGGTATATC ACATTGATTG 1080 

ATTTGTGGAT ACCTTTGTAT CCCTGGGATA AACCTCACTT GATCATGAGC TTTCAATGTA 1140 
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TTTTTGAATT CACTTTGCTA ATATTCTGTT GGGTATTTTT GCATCTCTAT TCATCAATGA 
TATTGGCCTA AGAAAGGTTT TGTCTGGTTT TAGTATCAGG GTGATGCTGG CCTCATAGAG 
AGAGTTTAGA AGCATTTCCT CCTCTTTGAT TTTTC6GAAT AGTTTGAGTA GGATAGGTAT 
TAACTCTTCT TTAAATGTTT GGGGACTTCC CTGGT6AGCC GGTGGTTGAG AATCCGCCTC 
AGGGATGT6G GTTTGATCCC TGGTCAGGGA ACCATTAATA AGATCCCACA TGCTGCAGGC 
AACAAGCCCC CAAGCTGCAA CCACTGAGCT GCAACCGCTG CAGTGCCCAC AGGCCACGAC 
CAGAGAAAGC CCACATACAG CAGGGAAGAC CCAGCACAAC CGGAAAAAGG AGTTTGGTGG 
AATACAGCTG TGAAGCCGTC TGGTCCTGGA CTCCTGCTTG AGGGAATTTT TTAAAAATTA 
TTGATTCAAT TTCATTACTG GTAACTGGTC TGTTCATATT TTCTATTTCT TCCGGGTTCA 
GTCTTGGGAG ATTGTACATG CCTAGGAATG TGTCC6TTTC TTCTAGGTTG TCCATTTTAT 
TGGACATGCA TGGGAGCACA CAGCACCGAC CAGCGAGACT CATGCTGGCT TCCTGGGGCC 
AGGCTGGGGC CCCAAGCAGC ATGGCATCCT AGAGTGTGTG AAAGCCCACT GACCCTGCCC 
AGCCCCACAA TTTCATTCTG AGAAGTGATT GCTTGCTTCT GCACTTACAG GCCCAGGATC 
TGACCTGCTT CTGAGGAGCA GGGGTTTTGG CAGGACGGGG AGATGCTGAG AGCCGACGGG 
GGTCCAGGTC CCCTCCCAGG CCCCCCTGTC TGGGGCAGCC CTTGGGAAAG ATTGCCCCAG 
TCTCCCTCCT ACAGTGGTCA GTCCCAGCTG CCCCAGGCCA GAGCTGCTTT ATTTCCGTCT 
CTCTCTCTGG ATGGTATTCT CTGGAAGCTG AAGGTTCCTG AAGTTATGAA TAGCTTTGCC 
CTGAAGGGCA TGGTTTGTGG TCACGGTTCA CAGGAACTTG GGAGACCCTG CAGCTCAGAC 
GTCCCGAGAT TGGTGGCACC CAGATTTCCT AAGCTCGCTG GGGAACAGGG CGCTTGTTTC 
TCCCTGGCTG ACCTCCCTCC TCCCTGCATC ACCCAGTTCT GAAAGCAGAG CGGTGCTGGG 
GTCACAGCCT CTCGCATCTA ACGCCGGTGT CCAAACCACC CGTGCTGGTG TTCGGGGGGC 
TACCTATGGG GAAGGGCTTC TCACTGCAGT GGTGCCCCCC GTCCCCTCTG AGATCAGAAG 
TCCCAGTCCG GACGTCAAAC AGGCCGAGCT CCCTCCAGAG GCTCCAG6GA GGGATCCTTG 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 




CCCCCCCGCT GCTGCCTCCA GCTCCTGGTG CCGCACCCTT GAGCCTGATC TTGTAGACGC 2580 

CTCAGTCTAG TCTCTGCCTC CGTGTTCACA CGCCTTCTCC CCATGTCCCC TCCGTGTCCC 2640 

CGTTTTCTCT CACAAGGACA CCGGACATTA GATTA6CCCC TGTTCCAGCC TCACCTGAAC 2700 

AGCTCACATC TGTAAAGACC TAGATTCCAA ACAAGATTCC AACCTGAAGT TCCCGGTGGA 2760 

TGTGAGTTCT GGGGCGACAT CCTTCAACCC CATCACAGCT TGCAGTTCAT CGCAAAACAT 2820 

GGAACCTGGG GTTTATCGTA AAACCCAGGT TCTTCATGAA ACACTGAGCT TCGAGGCTTG 2880 

TTGCAAGAAT TAAAGGTGCT AATACAGATC AGGGCAAGGA CTGAAGCTGG CTAAGCCTCC 2940 

TCTTTCCATC ACAGGAAAGG GGGGCCTGGG GGCGGCTGGA GGTCTGCTCC CGTGAGTGAG 3000 

CTCTTTCCTG CTACAGTCAC CAACAGTCTC TCTGGGAAGG AAACCAGAGG CCAGAGAGCA 3060 

AGCCGGAGCT AGTTTAGGAG ACCCCTGAAC CTCCACCCAA GATGCTGACC AGCCAGCGGG 3120 

CCCCCTGGAA AGACCCTACA GTTCAGGGGG GAAGAGGGGC TGACCCGCCA GGTCCCTGCT 3180 

ATCAGGAGAC ATCCCCGCTA TCAGGAGATT CCCCCACCTT GCTCCCGTTC CCCTATCCCA 3240 

ATACGCCCAC CCCACCCCTG TGATGAGCAG TTTAGTCACT TAGAATGTCA ACTGAAGGCT 3300 

TTTGCATCCC CTTTGCCAGA GGCACAAGGC ACCCACAGCC TGCTGGGTAC CGACGCCCAT 3360 

GTGGATTCAG CCAGGAGGCC TGTCCTGCAC CCTCCCTGCT CGGGCCCCCT CTGTGCTCAG 3420 

CAACACACCC AGCACCAGCA TTCCCGCTGC TCCTGAGGTC TGCAGGCAGC TCGCTGTAGC .3480 

CTGAGCGGTG TGGAGGGAAG TGTCCTGGGA GATTTAAAAT GTGAGAGGCG GGAGGTGGGA 3540 

GGTTGGGCCC TGTGGGCCTG CCCATCCCAC GTGCCTGCAT TAGCCCCAGT GCTGCTCAGC 3600 

CGTGCCCCCG CCGCAGGGGT CAGGTCACTT TCCCGTCCTG GGGTTATTAT GACTCTTGTC 3660 

ATTGCCATTG CCATTTTTGC TACCCTAACT GGGCAGCAGG TGCTTGCAGA GCCCTCGATA 3720 

CCGACCAGGT CCTCCCTCGG AGCTCGACCT GAACCCCATG TCACCCTTGC CCCAGCCTGC 3780 

AGAGGGTGGG TGACTGCAGA GATCCCTTCA CCCAAGGCCA CGGTCACATG GTTTGGAGGA 3840 



# 



74 



GCTGGTGCCC AA6GCAGAGG CCACCCTCCA GGACACACCT GTCCCCAGTG CTGGCTCTGA 3900 

CCTGTCCTTG TCTAAGAGGC TGACCCCGGA AGTGTTCCTG GCACTGGCAG CCAGCCTGGA 3960 

CCCAGAGTCC AGACACCCAC CTGTGCCCCC GCTTCTGGGG TCTACCAGGA ACCGTCTAGG 4020 

CCCAGAGGGG ACTTCCTGCT TGGCCTTGGA TGGAAGAAGG CCTCCTATTG TCCTCGTAGA 4080 

GGAAGCCACC CCGGGGCCTG AGGATGAGCC AAGTGGGATT CCGGGAACCG CGTGGCTGGG 4140 

GGCCCAGCCC GGGCTGGCTG GCCTGCATGC CTCCTGTATA AGGCCCCAAG CCTGCTGTCT 4200 

CAGCCCTCCA CTCCCTGCAG AGCTCAGAAG CACGACCCCA GGGATATCCC TGCAGCCATG 4250 

AAGTGCCTCC TGCTTGCCCT GGGCCTGGCC CTCGCCTGTG GCGTCCAGGC CATCATCGTC 4320 

ACCCAGACCA TGAAAGGCCT GGACATCCAG AAGGTTCGAG GGTTGGCCGG GTGGGTGAGT 4380 

TGCAGGGCGG GCAGGGGAGC TGGGCCTCAG AGAGCCAAGA GAGGCTGTGA CGTTGGGTTC 4440 

CCATCAGTCA GCTAGGGCCA CCTGACAAAT CCCCGCTGGG GCAGCTTCAA CCAGGCGTTC 4500 

ACTGTCTTGC ATTCTGGAGG CTGGAAGCCC AAGATCCAGG TGTTGGCAGG GCTGGCTTCT 4560 

CCTGCGGCCG CTCTCTGGGG AGCAGACGGC CGTCTTCTCC AGTCCTCTGC GCGCCCTGAT 4620 

TTCCTCTTCC TGTGAGGCCA CCAGGCCTGC TGGAAACACG CCTGCCTGCG CAGCTTCACA 4680 

CGACCTTTGT CATCTCTTTA AAGGCCATGT CTCCAGAGTC ATGTGTTGAA GTTCTGGGGG 4740 

TTAGTGGGAC ACAGTTCAGC CCCTAAAAGA GTCTCTCTGC CCCTCAAATT TTCCCCACCT 4800 

CCAGCCATGT CTCCCCAAGA TCCAAATGTT GCTACATGTG GGGGGGCTCA TCTGGGTCCC 4860 

TCTTTGGGTT CAGTGTGAGT CTGGGGAGAG CATTCCCCAG GGTGCAGAGT TGGGGGGAGT 4920 

ATCTCAGGGC TGCCCAGGCC GGGGTGGGAC AGAGAGCCCA CTGTGGGGCT GGGGGCCCCT 4980 

TCCCACCCCC AGAGTGCAAC TCAAGGTCCC TCTCCAGGTG GCGGGGACTT GGCACTCCTT 5040 

GGCTATGGC6 GCCAGCGACA TCTCCCTGCT GGATGCCCAG AGTGCCCCCC TGAGAGTGTA 5100 

CGTGGAGGAG CTGAAGCCCA CCCCCGAGGG CAACCTGGAG ATCCTGCTGC AGAAATGGTG 5160 

GGCGTCTCTC CCCAACATGG AACCCCCACT CCCCAGGGCT GTGGACCCCC CGGGGGGTGG 5220 




GGTGCAGGAG GGACCAGGGC CCCAGGGCTG 
GGCGCTCCAC CCAAGGCTGC CCACCCAGGG 
GATGCTTCAG AACATCATCA AACAAATGAA 
GGGGAGATAA AATCCTCTGA AGTGGAAATG 
ATTCTGAATT CCCTGTTAGT CTGAGGATTA 
TCATCATTTC TAGTCTGAAC ACCTCAGTAT 
GCAGTGTGGG GATAGGCCCG TGTGAAGGCT 
CTCAAGCAGT TCCCGCTACC AGCCCTGTCC 
AGCTGGATGG GTGTGGGGGC AGAGATGGGG 
GCCTGTGGTC AAGGCTCTCC CTGACCTTTT 
CATCCACCCG GTCCCCTGTG GCCTGAGGTG 
CTGGCTCCTA TGCCCATGCC ACCCCCCTCC 
CCTCAGTTCA TCCTGATGAA AATGGTCCAT 
GGGAGAACGG CGAGTGTGCT CAGAAGAAGA 
TGTTCAAGAT CGATGGTGAG TCCGGGTCCC 
GACTGTGGAC AGGTTCAGGG GGCTGGCGTC 
ATGAAGACAC TGCCTTGACA CCTGCTTCAC 
GGGGCGGTGG CCATGGGCAG GTCCCGGCTG 
CTCTCTTTGC TGGGGGGCGG GCGGTGCTCT 
GTGCCCTCCC AGGGGTAACC GAGAGCCGTT 
CCCCAGCCCG CTCCACAGCT CCTTCATCTC 
TTCACTTGTT CGTCCTAAAT CCGAGATGAT 




GGGAAGAGGG CTCAGAGTTT ACTGGTACCC 5280 

CTTTTTTTTT TTTTAAACTT TTATTAATTT 5340 

CATAAAACAT TCATTTTTGT TTACTTGGAA 5400 

CATAGCAAAG ATACATACAA TGAGGCAGGT 5460 

CAAGTGTATT TGAGCAACAG AGAGACATTT 5520 

CTAAAATGAA CAAGAAGTCC TGGAAACGAA 5580 

GCTGGGAGGC AGCAGACCTG GGTCTTCGGG 5640 

ACCTCAGACG GGGGTCAGGG TGCAGGAGAG 5700 

ACCTGAACCC CAGGGCTGCC TTTTGGGGGT 5760 

CTCTCTGGCT TCATCTGACT TCTCCTGGCC 5820 

ACAGTGAGTG CGCCGAGGCT AGTTGGCCAG 5880 

AGCCCTCCTG GGCCAGCTTC TGCCCCTGGC 5940 

GCCAATGGCT CAGAAAGCAG CTGTCTTTCA 6000 

TTATTGCAGA AAAAACCAAG ATCCCTGCGG 6060 

TGGGGGACAC CCACCACCCC CGCCCCCGGG 6120 

GGGCCCTGGG ATGCTAAGGG ACTGGTGGTG 6180 

TTGCCTCCCC TGCCACCTGC CCGGGGCCTT 5240 

GCGGGCTAAC CCACCAGGGT GACACCCGAG 6300 

GGGCCCTCAG GCTGAGCTCA GGAGGTACCT 6360 

GCCCACTCCA GGGGCCCAGG TGCCCCACGA 6420 

CTGGAGACAA ACTCTGTCCG CCCTCGCTCA 6480 

AAAGCTTCGA GGGGGGGTTG GGGTTCCATC 6540 



AGGGCTGCCC TTCCGCCGGG CAGCCTGGGC CACATCTGCC CTTGGCCCCC TCAGGACTCA 6600 

CTCTGACTGG AGGCCCTGCA CTGACTGACG CCAGGGTGCC CAGCCCAGGG TCTCTGGCGC 6660 

CATCCAGCTG CACTGGGTTT 6GGTGCTGGT CCTGCCCCCA AGCTGCCCGG ACACCACAGG 6720 

CAGCCGGGGC TGCCCACTGG CCTCGGTCAG GGTGAGCCCC AGCTGCCCCC GCTCAGGGCT 6780 

TGCCCCGACA ATGACCCCAT CCTCAGGACG CACCCCCCTT CCCTTGCTGG GCAGTGTCCA 6840 

GCCCCACCCG AGATCGGGGG AAGCCCTATT TCTTGACAAC TCCAGTCCCT GGGGGAGGGG 6900 

GCCTCAGACT GAGTGGTGAG TGTTCCCAAG TCCAGGAGGT GGTGGAGGGT CCTGGCGGAT 6960 

CCAGAGTTGA CAGTGAGGGC TTCCTGGGCC CCATGCGCCT GGCAGTG6CA GCAGGGAAGA 7020 

GGAAGCACCA TTTCAGGGGT GGGGGATGCC AGAGGCGCTC CCCACCCCGT CTTCGCCGGG 7080 

TGGTGACCCC GGGGGAGCCC CGCTGGTCGT GGAGGGTGCT GGGGGCTGAC TAGCAACCCC 7140 

TCCCCCCCCG TTGGAACTCA CTTTTCTCCC GTCTTGACCG CGTCCAGCCT TGAATGAGAA 7200 

CAAAGTCCTT GTGCTGGACA CCGACTACAA AAAGTACCTG CTCTTCTGCA TGGAAAACAG 7260 

TGCTGAGCCC GAGCAAA6CC TGGCCTGCCA GTGCCTGGGT GGGTGCCAAC CCTGGCTGCC 7320 

CAGGGAGACC AGCTGCGTGG TCCTTGCTGC AACAGGGGGT GGGGGGTGGG A6CTTGATCC 7380 

CCAGGAGGAG GAGGGGTGGG GGGTCCCTGA GTCCCGCCAG GAGAGAGTGG TCGCATACCG 7440 

GGAGCCAGTC TGCTGTGGGC CTGTGGGTGG CTGGGGACGG GGGCCAGACA CACAGGCCGG 7500 

GAGACG6GTG GGCTGCAGAA CTGTGACTGG TGTGACCGTC GCGATGGGGC CGGTGGTCAC 7560 

TGAATCTAAC AGCCTTTGTT ACCGGGGAGT TTCAATTATT TCCCAAAATA AGAACTCAGG 7620 

TACAAAGCCA TCTTTCAACT ATCACATCCT GAAAACAAAT GGCAGGTGAC ATTTTCTGTG 7680 

CCGTAGCAGT CCCACTGGGC ATTTTCAGGG CCCCTGTGCC AGGGGGGCGC GGGCATCGGC 7740 

GAGTGGAGGC TCCTGGCTGT GTCAGCCGGC CCAGGGGGAG GAAGGGACCC GGACAGCCAG 7800 

AGGTGGGGGG CAGGCTTTCC CCCTGTGACC TGCAGACCCA CTGCACTGCC CTGGGAGGAA 7860 

GGGAGGGGAA CTAGGCCAAG GGGGAAGGGC AGGTGCTCTG GAG6GCAAGG GCAGACCTGC 7920 




AGACCACCCT GGGGAGCAGG GACTGACCCC CGTCCCTGCC CCATAGTCAG GACCCCGGAG 7980 

GTGGACAACG AG6CCCTGGA GAAATTCGAC AAAGCCCTCA AGGCCCTGCC CATGCACATC 8040 

CGGCTTGCCT TCAACCCGAC CCAGCTGGA6 GGTGAGCACC CAGGCCCCGC CCTTCCCCAG 8100 

GGCAGGAGCC ACCCGGCCCC GGGACGACCT CCTCCCATGG TGACCCCCAG CTCCCCAGGC 8160 

CTCCCAGGAG GAAGGGGTGG GGTGCAGCAC CCCGTGGGGG CCCCCTCCCC ACCCCCTGCC 8220 

AGGCCTCTCT TCCCGAGGTG TCCAGTCCCA TCCTGACCCC CCCATGACTC TCCCTCCCCC 8280 

ACAGGGCAGT GCCACGTCTA GGTGAGCCCC TGCCGGTGCC TCTGGGGTAA GCTGCCTGCC 8340 

CTGCCCCACG TCCTGGGCAC ACACATGGGG TAGGGGGTCT TGGTGGGGCC TGGGACCCCA 8400 

CATCAGGCCC TGGGGTCCCC CCTGTGAGAA TGGCTGGAAG CTGGGGTCCC TCCTGGCGAC 8450 

TGCAGAGCTG 6CTGGCCGCG TGCCACTCTT GTGGGTGACC TGTGTCCTGG CCTCACACAC 8520 

TGACCTCCTC CAGCTCCTTC CAGCAGAGCT AAGGCTAAGT 6AGCCAGAAT GGTACCTAAG 8580 

GGGAGGCTAG CGGTCCTTCT CCCGAGGAGG GGCTGTCCTG GAACCACCAG CCATGGAGAG 8640 

GCTGGCAAGG GTCTGGCAGG TGCCCCAGGA ATCACAGGGG GGCCCCATGT CCATTTCAGG 8700 

GCCCGGGAGC CTTGGACTCC TCTGGGGACA GACGACGTCA CCACCGCCCC CCCCCCATCA 8760 

GGGGGACTAG AAGGGACCAG GACTGCAGTC ACCCTTCCTG GGACCCAGGC CCCTCCAGGC 8820 

CCCTCCTGGG GCTCCTGCTC TGGGCAGCTT CTCCTTCACC AATAAAGGCA TAAACCTGTG 8880 

CTCTCCCTTC TGAGTCTTTG CTGGACGACG GGCAGGGGGT GGAGAAGTGG TGGGGAGGGA 8940 

GTCTGGCTCA GAGGATGACA GCGGGGCTGG GATCCAGGGC GTCTGCATCA CAGTCTTGTG 9000 

ACAACTGGGG GCCCACACAC ATCACTGCGG CTCTTTGAAA CTTTCAGGAA CCAGGGAGGG 9060 

ACTCGGCAGA GACATCTGCC AGTTCACTTG GAGTGTTCAG TCAACACCCA AACTCGACAA 9120 

AGGACAGAAA GTGGAAAATG GCTGTCTCTT AGTCTAATAA ATATTGATAT GAAACTCAAG 9180 

TTGCTCAT6G ATCAATATGC CTTTATGATC CAGCCAGCCA CTACTGTCGT ATCAACTCAT 9240 
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GTACCCAAAC GCACTGATCT GTCTGGCTAA TGATGAGAGA TTCCCAGTAG AGAGCTGGCA 9300 

AGAGGTCACA GTGAGAACTG TCTGCACACA CAGCAGAGTC CACCAGTCAT CCTAAGGAGA 9360 

TCAGTCCTGG TGTTCATTGG AGGACTGATG TTGAAGCTGA AACTCCAATG CTTTGGCCAC 9420 

CTGATGTGAA GAGCTGACTC ATTTGAAAAG ACCCTGATGC TGGGAAAGAT TGAGGGCAGG 9480 

AGGAGAAGGG GACGACAGAG GATGAGATGG TTGGATGGCA TCACCAACAC AATGGACATG 9540 

GGTTTGGGTG GACTCCAGGA GTTGGTGATG GACAGGGAGG CCTGGCGTGC TACGGAAGCG 9600 

GTTTATGGGG TCACAAAGAC TGAGTGACTG AACTGAGCTG AACTGAATGG AAATGAGGTA 9560 

TACAGCAAAG TGGGGATTTT TTAGATAATA AGAATATACA CATAACATAG TGTATACTCA 9720 

TATTTTTATG CATACCTGAA TGCTCAGTCA CTCAGTCGTA TCTGACTCTG TGACCTATGG 9780 

ACCGTAGCCT TCCAGGTTTC TTCTGTCCAC AGAATTCTCC AAGGCAAGAA TACTGGAGTG 9840 

GGTAGCCATT TCCTCCTCCA GGGGATCCTC CCGACCCAGG GATTGAACCG GCATCTCCTG 9900 

TATTGGCAGG TGGATTCTTT ACCACTGTGC CACCAGGGAA GCCCGTGTTA CTCTCTATGT 9960 

CCCACTTAAT TACCAAAGCT GCTCCAAGAA AAAGCCCCTG TGCCCTCTGA GCTTCCCGGC 10020 

CTGCAGAGGG TGGTGGGGGT AGACTGTGAC CTGGGAACAC CCTCCCGCTT CAGGACTCCC 10080 

GGGCCACGTG ACCCACAGTC CTGCAGACAG CCGGGTAGCT CTGCTCTTCA AGGCTCATTA 10140 

TCTTTAAAAA AAACTGAGGT CTATTTTGTG ACTTCGCT6C CGTAACTTCT GAACATCCAG 10200 

TGCGATGGAC AGGACCTCCT CCCCAGGCCT CAGGGGCTTC AGGGAGCCAG CCTTCACCTA 10260 

TGAGTCACCA GACACTCGGG GGTG6CCCCG CCTTCA6GGT GCTCACAGTC TTCCCATCGT 10320 

CCTGATCAAA GAGCAAGACC AATGACTTCT TAGGAGCAAG CAGACACCCA CAGGACACTG 10380 

AGGTTCACCA GAGCTGAGCT GTCCTTTTGA ACCTAAAGAC ACACAGCTCT CGAAGGTTTT 10440 

CTCTTTAATC TGGATTTAAG GCCTACTTGC CCCTCAAGAG GGAAGACAGT CCTGCATGTC 10500 
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CCCAGGACAG CCACTCGGTG GCATCCGAGG CCACTTAGTA TTATCTGACC GCACCCTGGA 10550 

ATTAATCGGT CCAAACTGGA CAAAAACCTT GGTGGGAAGT TTCATCCCAG AGGCCTCAAC 10620 

CATCCTGCTT TGACCACCCT GCATCTTTTT TTCTTTTATG TGTATGCATG TATATATATA 10680 

TATATATTTT TTTTTTTTTC ATTTTTTGGC TGTGCTGGCT GTTCGTTGCA GTTCGGTGCG 10740 

CA6GCTTCTC TCTAGTTTCT CTCTAGTCTT CTCTTATCAC AGAGCAGTCT CTAGACGATC 10800 

6ACGCGT 10807 
(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
AATTCCGATC GACGCGTCGA CGATATACTC TAGACGATCG ACGCGTA 47 
(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: BLGAMP3 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
TGGATCCCCT GCCGGTGCCT CTGG 



24 




(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: BL6AMP4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AACGCGTCAT CCTCTGTGAG CCAG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6839 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11 
ACTACGTAGT 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6632 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGACGCGGAT CCTACGTACC TGCAGCCATG TTTTCCATGA GG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6627 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
AGGGCTTCGG CAAGCTTCAG G 
(2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6521 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 



GCCAAAGACT TACTTCCCTC TAGA 




(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC5520 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GCATGAACGT CGCGTGGTGG TTGTGCTACC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6519 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
ACCACGCGAC GTTCATGCTC TAAAACCGTT 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6518 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCTGCGGGAT CCTACGTACT AGGGGGACAG GGAAGG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6629 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CGACGCGAAT TCTACGTACC TGCAGCCATG AAAAGGATGG TTTCT 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6530 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGACGCGAAT TCTACGTACC TGCAGCCATG AAACATCTAT TATTG 




(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6625 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20 
GTGA6ATTTT CAGATCTTGT C 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6626 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21 
AAGAATTACT GTGGCCTACC A 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GCTGCGGAAT TCTACGTACT ATTGCTGTGG GAA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6514 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 
CGACGCGGAT CCTACGTACC TGCAGCCATG AGTTGGTCCT TGCAC 45 
(2) INFORMATION FOR SEQ ID N0:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Tinear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: zc6517 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 
GTCTCTGGTA GCAACATACT A 21 




(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: zc6516 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25 
GGGTTTCTAG CCCTACTAGT AG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: zc6515 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26 



GGGTTTCTAG CCCTACTAGT AG 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

AAGCTACGCG TCGATCGTCT AGAGTATATC GTCGACGCGT CGATCGG 



